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GENE FOR ATAXIA 



The present invention relates to the isolation, identification and characterization of 
5 newly identified human gene responsible for disorders relating to ataxia, as well as the 
diagnosis and therapy of such disorders. 

Ataxia in general describes the inability of a patient to properly coordinate his or her 
movements and is caused by neuronal defects. The autosomal dominant forms mostly 
10 represent neurodegenerative disorders and are characterized by the selective loss of 
neurons within the brainstem and spinocerebellar tracts. Recessive forms of ataxia are 
often based on defects in neuronal cell communication, such as defective ion channels 
(Doyle and Stubbs, 1998). 

15 Up to now 8 types of dominantly inherited ataxia have been genetically defined: 
Spinocerebellar Ataxia type 1-7 (SCA 1-7) and Dentato-Rubral Pallido-Luysian Atrophy 
(DRPLA). Recent years have seen an impressive genetic contribution. The first gene for 
hereditary ataxia, SCA1, was discovered in 1993, and the gene of SCA7 was identified 
in 1996. Genetic mutations of SCA1, 2, 3, 6 and 7, have now been identified. For SCA4 

20 and SCA5 the responsible genes have not been identified yet. For the SCA's with a 
known genetic mutation, individual genetic diagnosis and presymptomatic testing is 
possible. 

Thus far, all known mutations for the different SCA's and for DRPLA consist of an 
25 unstable expansion of a CAG repeat in one of the gene's exons (reading frames). ,CAG' 
stands for cytosine, adenine and guanine, the names of separate DNA molecules, which 
form a ,triplet'. Dominant hereditary ataxia's are not the only disorders caused by an 
expanded triplet repeat. Among the disorders caused by an expanded triplet repeat are 
Huntington's disease, Friedreich's ataxia and Myotonic dystrophy. 

30 

In the SCA's, DRPLA, and in some other inherited diseases like Huntington's disease, a 
normally present CAG triplet repeat is expanded and has become instable. Translated 
into a protein, this expanded CAG triplet repeat results in a expanded tract of identical 
glutamine amino acids (a polyglutamine tract). Once the gene has been identified, the 
35 gene-products, proteins, can be studied. With one exception, SCA6, the proteins are 
previously unknown, novel proteins, with unknown functions. For SCA, the novel 
proteins are named ,ataxin' with the corresponding SCA number, e.g. ataxin-1. Similarly 
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for DRPLA the protein is called 4 atrophin\ for Huntington's disease the protein is called 
,huntingtin' and for Friedreich's ataxia, the protein is called ,frataxin'. 

SCA6 is the only Spinocerebellar Ataxia in which the function of the gene and protein is 
5 known. The protein in SCA6 is part of a voltage gated calcium channel in the cellular 
membrane, which plays a role in the cellular excitation. Other mutations in the gene for 
SCA6 can cause two other (allelic) disorders: hemiplegic migraine and episodic ataxia 
type II. 

10 Combining the clinical information in the different Spinocerebellar Ataxia's, some 
correlations become apparent. Within groups of SCA1, SCA2, SCA3, and SCA7 
patients, the length of the CAG repeat shows a strong inverse correlation with the age at 
onset of symptoms, and also with progression of the disease. So, people with a later 
onset of symptoms usually have a smaller expansion of the CAG repeat. However, at 

1 5 any repeat length there is a rather wide variation in age at onset, of some twenty years or 
so. Thus, in a single individual the repeat length does not enable prediction of the age at 
onset or progression. Although essentially similar during an individual's lifetime, the 
CAG repeat is not stable during parent to child transmission, and may thus be different 
in parents and children. Especially in paternal transmissions, the expansion often 

20 increases. This in part explains anticipation, the phenomenon that the mean age at onset 
is earlier in next generations. In maternal transmission the repeat expansion is usually 
more stable. 

Clinical manifestations vary considerably within each genetically defined type, and with 
25 exception of SCA7 and SCA6, only few global characteristics may be discerned for the 
SCA's, precluding an individual clinical diagnosis. 

Research on histological and pathological aspects has focused mainly on the mechanism 
which causes cell death and ataxia. First, the presence of ataxin has been studied. As 

30 SCA's are dominantly inherited disorders, both a normal and a mutated SCA gene are 
present, and so both normal and mutated ataxin are expressed. It must be realised that 
genes are not active all the time, and that genes are turned on or off by signals according 
to cellular activity and specialisation. In the case of SCA1 and SCA2 for instance, it 
appears that ataxin-1 and ataxin-2 are present in many tissues and cells, and that the 

35 levels of ataxin do not correlate well with the presence of localised neurological cell 
death. 
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To explain the discrepancy between the presence of (mutated) ataxin and cell 
degeneration, interaction with other, possibly cell specific proteins has been suggested, 
and indeed, in Huntington's disease and various SCA's, aggregation with proteins has 
been demonstrated. Also, a tendency has been demonstrated for mutated ataxins to 
5 (self)aggregate, and proteins containing an expanded polyglutamine stretch have been 
found te be toxic for cultured cells. 

Last year it was reported that in Huntington's disease and in SCA3 or Machado- Joseph 
Disease, certain brain cells showed inclusion bodies within the cell's nucleus. These 

10 inclusion bodies contained partially degraded molecules of mutated huntingtin and 
ataxin. Normally, huntingtin and ataxin are present in the cytoplasm of these cells, that 
is the fluid surrounding the nucleus, hut not within the nucleus. The presence of 
intranuclear inclusion bodies seemed to correlate with degeneration of these neurons. As 
these inclusion bodies also contained a protein degrading molecule, named ubiquitin, 

15 these inclusion bodies seem to indicate that the protein aggregate which they contain is 
resistant to degradation. 

Friedreich's ataxia is the most frequent recessive ataxia occurring among white 
populations. But there are other recessive ataxias as well. They inelude: a) Ataxia- 

20 Telangiectasia (AT). This is an early onset ataxia with dilated veinules and 
telangiectasia. AT as it is known is almost as frequent as Freidreich's ataxia, b) Familial 
Vitamin E Deficiency, of which there are two forms, with or without fat malabsorption. 
The form without fat malabsorption is more common in the North African population, 
but rare everywhere else. Vitamin E supplementation is an effective treatment. Further 

25 known ataxias are c) Refsum Disease (ataxia with neuropathy and retinitis), d) Infantile 
Onset Spinocerebellar Ataxia, or IOSCA, a Finnish disease unknown in other world 
populations, e) Spastic Ataxia of Charlevoix-Saguenay. This is found mostly in Quebec, 
Canada. There exist further rare ataxias that don't have a name because they are not well 
charactenised. In two copies of the ataxia defect or mutation, one received from each 

30 parent. The parents of the patient are not affected because they carry only one copy. For 
all recessive diseases, the frequency of healthy carriers is much higher than the 
frequeney of patients, because the risk that two carriers of the same ataxia mutation 
marry is relatively low. For example, for Friedreich's ataxia, the frequency of healthy 
carriers is 1 in 120 while the frequeney of patients is 1 in 40,000. Almost all 

35 Friedreich's ataxia carriers and patients have the same ancestor (who lived most likely 
more than 10,000 years ago). This explains why Friedreich ataxia is more prevalent in 
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white populations and is almost non-existent among Japanese and black African 
populations. 

The genetic defect in the gene leads to a lower expression of the protein called frataxin 
5 which is expressed about 10 to 20 times less compared to the expression in healthy 
persons. The important point is that there is still a little bit of frataxin made. This distin- 
guishes the expansion mutation from other mutations, referred to as truncating 
mutations and frequently found in other recessive diseases which completely alter the 
production of the protein. Frataxin is a protein of the mitochondria, the structures of a 

10 cell where its fuel is produced from food. This energy conversion requires electron 
transport with iron containing proteins. Frataxin is present in tissues that are rich in 
mitochondria, which include the tissues that are affected in patients (dorsal root ganglia 
of the spinal cord, heart). Frataxin is present in every living organism, including fungi 
and yeast, because they also contain mitochondria. In yeast complete absence of frataxin 

15 causes iron accumulation within the mitochondria. In Friedreich* s ataxia patients, a 
similar defect is also likely to apply, but to a lesser extent since there is still some 
residual frataxin present. Evidence for this are iron deposits and defect of some iron 
containing proteins in heart of patients. Mitochondrial iron accumulation seems 
therefore to be the culprit, since excess of iron is well known to stimulate the production 

20 of toxic compounds called free radicals on reactive oxygen species. 

The present invention relates to the isolation and characterization of genes involved in 
constitution and regulation of the vertebrate nervous system, which is a major step 
towards elucidating the complex interactions required for an intact nervous system. The 
25 gene was identified via a positional cloning approach in the course of which the 
cytogenetic breakpoints of a patient suffering from severe cerebellar ataxia had been 
studied. 

In order to investigate the breakpoint on the long arm of the X-chromosome, several 
30 YAC, PAC and cosmid clones (Vetrie et al., 1994) were chosen covering the region of 
CEPHy904G09223. DNA of these clones was used for FisH-analysis on metaphase 
spreads of the patient. It could be demonstrated that the PAC clone 1055C14 covers the 
breakpoint. The cosmids in close vicinity to the breakpoint-spanning PAC had already 
been sequenced by the Sanger Center in Cambridge. 

35 
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Sequence analysis of Cosmid 9N5 allowed the design of appropriate primers for certain 
interesting regions. This allowed the amplification of parts of the potential new gene 
using a fetal brain cDNA library. The first seven exons of the gene could be identified. 
In order to obtain a full length cDNA clone, RACE experiments were performed and the 
5 resulting fragments were subcloned into a TOPO-vector (Invitrogen) and sequenced. 

The complete cDNA has a coding region of 125 1 bp encoding for 41 7 amino acids and a 
3' untranslated region of 348 bp. The gene consists of nine exons ranging in size from 
68 bp to 224 bp and is orientated from telomere (5 'end) to centromere (3* end) (fig. 3). 
1 0 Exon 8 and 9 as well as the 3'UTR represent novel sequence data. The sequence as well 
as the translation into the potential protein are given in SEQ ID No. 1 and SEQ ID No. 
2. 

With respect to patient T.G., it was important to show whether the chromosomal 
15 breakpoint had affected this new gene. For this purpose all genomic fragments 
containing exons of the ataxia gene were isolated from PAC 1055C14, pooled and used 
for FisH-analysis on metaphase spreads of the patient's mother. The normal X- 
chromosome showed a signal in Xq22 plus a crosshybridization to Xq28. The 
rearranged X-chromosome showed a strong signal in Xp22, the crosshybridization 
20 signal and a weak signal in Xq22. This shows clearly that the breakpoint of the patient 
resides within the genomic locus of the ataxia gene. 

To further verify this observation, Southern blot analysis was performed, using as a 
probe the entire ataxia cDNA. When using EcoRV as restriction enzyme, a band shift 
25 was observed in the patient, but not in healthy controls. This suggests, that indeed this 
novel gene is affected by the chromosomal breakage that has occured in the patient and 
might therefore contribute to the phenotype seen in the patient. 

However, the gene responsible for the correct mediation of neuronal signalling can be 
30 used with respect to the diagnostic determination of other neurological diseases with yet 
unknown etiology, such as spastic paraplegia mutations in PLP, Pelizaeus-Merzbacher 
disease, diseases related to the vertebrate central nervous system and other various 
neurological diseases of yet unkown etiology. 
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The isolated genomic DNA or fragments thereof can be used for pharmaceutical 
purposes or as diagnostic tools or reagents for identification or characterization of the 
genetic defect involved in the disorders and diseases mentioned above. 



5 Subject of the present invention are further ataxia proteins which are expressed after 
transcription of the ataxia gene into RNA or mRNA and which can be used in the 
therapeutic treatment of disorders related to mutations in said genes. The invention 
further relates to appropriate cDNA sequences which can be used for the preparation of 
recombinant proteins suitable for the treatment of such disorders. 

10 

Subject of the invention are further plasmid vectors for the expression of the DNA of 
these genes and appropriate cells containing such DNAs. It is a further subject of the 
present invention to provide means and methods for the genetic treatment of such 
disorders in the area of molecular medicine using an expression plasmid prepared by 
15 incorporating the nucleic acid molecules of this invention downstream from an 
expression promotor which effects expression in a mammalian host cell. 

Brief Description of the SEP ID : 

20 SEQ ID NO. 1 : Sequence of ataxia cDNA. Exon/Intron junctions are at the following 
positions: 72/73 (exon 1; size: 72 bp); 202/203 (exon 2; size: 130 bp); 270/271 
(exon 3; size: 68 bp); 494/495 (exon 4; size: 224 bp); 575/576 (exon 5; size: 81 
bp); 718/719 (exon 6; size: 143 bp); 933/934 (exon 7; size: 215 bp); 1083/1084 
(exon 8; size: 150 bp); 1252 ATG stop codon (exon 9; size: 169 bp). 
25 SEQ ID NO. 2: ataxia protein 

SEQ ID NO. 3: exon 8 with neighbouring genomic sequences of the ataxia gene. 

Direction is from 5' towards the 3' end. 
SEQ ID NO. 4: amino acid sequence for exon 8 

SEQ ID NO. 5: exon 9 with neighbouring genomic sequences of the ataxia gene. 
30 Direction is from 5' towards 3' end. 

SEQ ID NO. 6: part of the genomic sequence as published by the Sanger Center 
comprising exons 1 - 7. Exon/intron boundaries are at the following position of 
the sequence: exonl: 29850 - 29921 (72 bp); exon 2: 33026 - 33155 (130 bp); 
exon 3: 33445 - 33514 (68 bp); exon 4: 33752 - 33975 (224 bp); exon 5: 341 15 - 
35 34195 (81 bp); exon 6: 35760 - 35901 (143 bp); exon 7: 38782 - 38996 (215 bp). 

With respect to the published complementary strand, the exon/intron boundaries 
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(5 '-boundary to 3 '-boundary) are at the following positions of the sequence: exon 
1: 9946 - 9875 (72 bp); exon 2: 6770 - 6641 (130 bp); exon 3: 6351 - 6282; (68 
bp); exon 4: 6044 - 5821 (224 bp); exon 5: 5681 r- 5601 (81 bp); exon 6: 4036 - 
3895 (143 bp); exon 7: 1014 - 800 (215 bp). 

5 

Brief Description of the figures: 

Fig. 1: Schematic presentation of the genomic organization of the gene. Exons 1 - 7 
were already sequenced by genome-wide sequencing efforts. Exons 8 and 9 as 
1 0 well as the 3'UTR have been sequenced according to the present invention. 

Since the target gene leading to ataxia was unknown prior to the present invention, the 
biological and clinical association of patients with this deletion could give insights to 
the function of this gene. In the present study, fluorescence in situ hybridization (FISH) 
1 5 was used to examine metaphase and interphase lymphocyte nuclei of patients. 

Subject of the present invention are therefore DNA sequences or fragments thereof 
which are part of the genes responsible for ataxia. DNA sequences or fragments of this 
gene, as well as the respective full length DNA sequence of this gene can be 

20 transformed in an appropriate vector and transfected into cells. When such vectors are 
introduced into cells in an appropriate way as they are present in healthy humans, it is 
devisable to treat diseases of patients suffering from various types of ataxia by modern 
means of gene therapy. For example, ataxia can be treated by removing the respective 
mutated gene. It is also possible to stimulate the respective genes which compensate the 

25 action of the genes responsible for ataxia, i.e. by inserting DNA sequences before, after 
or within the ataxia gene in order to increase the expression of the healthy allels. By 
such modifications of the genes, the ataxia gene become activated or silent, respectively. 
This can be accomplished by inserting DNA sequences at appropriate sites within or 
adjacent to the gene, so that these inserted DNA sequences interfere with the ataxia gene 

30 and thereby activate or prevent their transcription. It is also devisable to insert a 
regulatory element (e.g. a promotor sequence) before said ataxia gene to stimulate the 
gene to become active. It is further devisable to stimulate the respective promotor 
sequence in order to overexpress the healthy functional allele. The modification of genes 
can be generally achieved by inserting exogenous DNA sequences into the ataxia gene 

35 via homologous recombination. 
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The DNA sequences according to the present invention can also be used for 
transformation of said sequences into animals, such as mammals, via an appropriate 
vector system. These transgenic animals can then be used for in vivo investigations for 
screening or identifying pharamceutical agents which are useful in the treatment of 
5 diseases involved with defects of the ataxia gene. If the animals positively respond to 
the administration of a candidate compound or agent, such agent or compound or 
derivatives thereof would be devisable as pharmaceutical agents. By appropriate means, 
the DNA sequences of the present invention can also be used in genetic experiments 
aiming at finding methods in order to compensate for the loss of genes responsible for 
1 0 ataxia (knock-out animals). 

In a further object of this invention, the DNA sequences can also be used to be 
transformed into cells. These cells can be used for identifying pharmaceutical agents 
useful for the treatment of diseases involved with ataxia, or for screening of such 
15 compounds or library of compounds, especially using high-throuput screening systems. 
In an appropriate test system, variations in the phenotype or in the expression pattern of 
these cells can be determined, thereby allowing the identification of interesting 
candidate agents in the development of pharmaceutical drugs. 

20 The DNA sequences of the present invention can also be used for the design of 
appropriate primers which hybridize with segments of the ataxia gene or fragments 
thereof under stringent conditions. Appropriate primer sequences can be constructed 
which are useful in the diagnosis of people who have a genetic defect causing ataxia. 
Such primers comprise nucleic acid sequences having a length of 10 - 40 nucleotides, 

25 preferably 1 5 - 30 nucleotides. Especially preferred primers are given in example 3. 

In general, DNA sequences according to the present invention are understood to 
embrace also such DNA sequences which have at least 80 %, preferably at least 90 %, 
or at least 95 % or 98 % sequence identity to the specific DNA sequences according to 

30 the present invention. This also includes such nucleic acid sequences which are 
degenerate to the specific sequences shown, based on the degeneracy of the genetic 
code, or fragments thereof which hybridize under stringent conditions with the 
specifically shown DNA sequences. Hybridisation is to be understood to take place at 
high stringency conditions using Church buffer (0.5 M NaPi pH 7.2, 7% SDS, 1 mM 

35 EDTA) at 65°C and by washing in 40 mM NaPi, 1 % SDS at 65°C. 
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In principle, all oligonucleotide primers and probes for amplifying and detecting a 
genetic defect responsible for ataxia in a biological sample are suitable for amplifying a 
target ataxia associated sequence. Especially, suitable exon specific primer pairs can be 
provided. Subsequently, a suitable detection, e.g. a radioactive or non-radioactive label 
5 is carried out. 

Also, a single stranded RNA can be used as target. Methods for reversed transcribing 
RNA into cDNA are also well known and described in Sambrook et al., Molecular 
Cloning: A Laboratory Manual, New York, Cold Spring Harbor Laboratory 1989. 
10 Alternatively, preferred methods for reversed transcription utilize thermostable DNA 
polymerases having RT activity. 

Further, the technique described before can be used for selecting those person from a 
group of persons being of short stature characterized by a genetic defect and which 
1 5 allows as a consequence a more specific medical treatment. 

In another subject of the present invention, the ataxia protein can be used as 
pharmaceutical agents. These proteins initiate a still unknown cascade of biological 
effects on a molecular level involved with ataxia. They can be used in the treatment of 

20 ataxia, and related diseases based on the same etiology. Within the meaning of the 
present invention, the term "ataxia protein" also comprises proteins and functional 
fragments thereof, which have a similar or comparable physiological effect as the ataxia 
protein described in SEQ ID. No. 2. Such proteins comprise modifications, deletions, 
substitutions or variations of one or more amino acids, whereby the resulting variant has 

25 a homology of at least 80 %, preferably at least 90 %, 95 % or 99 % of the protein 
described in SEQ ID. 2. The ataxia proteins have a length of 300 - 600 amino acids, 
preferably 350 - 500 amino acids, and most preferably of 400 - 450 amino acids. 

As used herein, the term „isolated" refers to the original derivation of the DNA 
30 molecule by cloning. It is to be understood however, that this term is not intended to be 
so limiting and, in fact, the present invention relates to both naturally occurring and 
synthetically prepared seqences, as will be understood by the skilled person in the art. 

The DNA molecules of this invention may be used in forms of gene therapy involving 
35 the use of an expression plasmid prepared by incorporating an appropriate DNA 
sequence of this invention downstream from an expression promotor that effects 
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expression in a mammalian host cell. Suitable host cells are procaryotic or eucaryotic 
cells. Procaryotic host cells are, for example, E. coli, Bacillus subtilis, and the like. By 
transfecting host cells with replicons originating from species adaptable to the host, that 
is, plasmid vectors containing replication starting point and regulator sequences, these 
5 host cells can be transfected with the desired gene or cDNA. Such vectors are preferably 
those having a sequence that provides the transfected cells with a property (phenotype) 
by which they can be selected. For example, for E. coli hosts the strain E. coli K12 is 
typically used, and for the vector either pBR322 or pUC plasmids can be generally 
employed. Examples for suitable promotors for E. coli hosts are trp promotor, lac 

10 promotor or lpp promotor. If desired, secretion of the expression product through the 
cell membrane can be effected by connecting a DNA sequence coding for a signal 
peptide sequence at the 5 ' upstream side of the gene. Eucaryotic host cells include cells 
derived from vertebrates or yeast etc.. As a vertebrate host cell, COS cells can be used 
(Cell, 1981, 23: 175 - 182), or CHO cells. Preferably, promotors can be used which are 

15 positioned 5' upstream of the gene to be expressed and having RNA splicing positions, 
polyadenylation and transcription termination seqences. 

The present invention is illustrated by the following examples. 

20 

Example 1 
Patient 

Patient T.G. is a ten year old boy suffering from slight mental retardation and severe 
25 cerebellar ataxia. In a MRI-scan severe hypo- and dysmyelinisation was observed infra- 
and supra- tentorial. Cytogenetically, the boy, as well as his healthy mother, exhibit an 
inversion of the X-chromosome with the breakpoints residing in Xp22 and Xq22, 
respectively. 

30 

Example 2 
Identification of the ataxia gene 

a) Florescence in situ hybridization (FISH) 
35 FISH studies using appropriate cosmids were carried out according to published 
methods (Lichter and Cremer, 1992). In short, one microgram of the respective cosmid 
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clone was labeled with biotin and hybridized to human metaphase chromosomes under 
conditions that suppress signals from repetitive DNA sequences. Detection of the 
hybridization signal was via FITC-conjugated avidin. Images of FITC were taken by 
using a cooled charge coupled device camera system (Photometries, Tucson, AZ). 

5 

b) Physical mapping 

In order to investigate the breakpoint on the long arm of the X-chromosome, several 
YAC, PAC and cosmid clones (Vetrie et al., 1994) were chosen covering the region of 
CEPHy904G09223. DNA of these clones was used for FisH-analysis on metaphase 
10 spreads of the patient. It could be demonstrated that the PAC clone 1055C14 covers the 
breakpoint. The cosmids in close vicinity to the breakpoint-spanning PAC had already 
been sequenced by the Sanger Center in Cambridge. 

c) Southern Blot Hybridisation 

15 Southern blot hybridisations were carried out at high stringency conditions in Church 
buffer (0.5 M NaPi pH 7.2, 7% SDS, ImM EDTA) at 65°C and washed in 40 mM NaPi, 
l%SDSat65°C. 

d) FISH Analysis 

20 Biotinylated cosmid DNA (insert size 32 - 45 kb) or cosmid fragments (10 - 16 kb) were 
hybridised to metaphase chromosomes from stimulated lymphocytes of patients under 
conditions as described previously (Lichter and Cremer, 1992). The hybridised probe 
was detected via avidin-conjugated FITC. 

25 e) PCR Amplification 

All PCRs were performed in 50 volumes containing 100 pg-200 ng template, 20 
pmol of each primer, 200 dNTP's (Pharmacia), 1.5 mM MgCl 2 , 75 mMTris/HCl 
pH9, 20mM (NH 4 ) 2 S0 4 0.01% (w/v) Tween20 and 2 U of Goldstar DNA Polymerase 
(Eurogentec). Thermal cycling was carried out in a Thermocycler GeneE (Techne). 

30 

f) Exon Amplification 

For exon amplification experiments, the methods as previously described in Church et 
al., 1994 were used. 
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Example 3 

Primer for the amplification of ataxia exons 



Exons 2 and 3 as well as exons 4 and 5 were amplified in one PCR-reaction, as the 
5 introns are rather small. 

GLRA4 Exon 1 For : 5'-ATG ACA ACT CTT GTT CCT GCA ACC CTC TCC-3' 
GLRA4 Exon 1 Rev : 5'-TGG CTA GTG TTT GCA TGC ACC-3' 
PCR conditions : 30" 94°C, 30" 58°C, 30" 72°C, 38 cycles 

10 

GLRA4 Exon 2 For : 5'-GCA CAT AAC TGG CCT CAG ACT T-3' 



GLRA4 Exon 3 Rev : 5'-CAC TCA ACA TAG GCT GGA AGT-3' 
PCR conditions : 30" 94°C, 30" 60°C, 45" 72°C, 38 cycles 

15 

GLRA4 Exon 4 For : 5'-CCT GAG ATG TGT TCC CAA CAT-3' 



GLRA4 Exon 5 For : 5'-CCA GTA AGC CGA TGT CAC TTC-3' 
GLRA4 Exon 5 Rev : 5'-GTT ATT CCA GGC TCT CTG TGA-3' 
20 PCR conditions : 30" 94°C, 30" 59°C, 30" 72°C, 38 cycles 

GLRA4 Exon 6 For : 5'-CAG CAT CCA TAC TCT GCA GC-3* 
GLRA4 Exon 6 Rev : 5'-AGG TTC TCC TGT GGC TCA CA-3' 
PCR conditions : 30" 94°C, 30" 60°C, 30" 72°C, 38 cycles 

25 

GLRA4 Exon 7 For : 5'-TCA GGC TCA GCT ACA GGC TG-3" 
GLRA4 Exon 7 Rev : 5'-GGT ACT CTA TGG CAA GCA AGT T-3' 
PCR conditions : 30" 94°C, 30" 58°C, 30" 72°C, 38 cycles 

30 GLRA4 Exon 8 For : 5'-GTG TCC TAC GTG AAG GCA AT-3' 
GLRA4 Exon 8 Rev : 5'-TCC AAG CGT TGG CGC CTC T-3' 
PCR conditions : 30" 94°C, 30" 58°C, 30" 72°C, 38 cycles 

GLRA4 Exon 9 For : 5'-CCA AGG CAC CTT GTC TGC ATA ACA-3' 
35 GLRA4 Exon 9 Rev : 5'-GGA GAT GGT GTC AAT TCT CTT GGC-3' 
PCR conditions : 30" 94°C, 30" 59°C, 30" 72°C, 38 cycles 
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Claims 

1 . An isolated human nucleic acid sequence encoding a human ataxia protein having 
essentially the amino acid sequence of SEQ ID NO: 2. 

2. A nucleic acid sequence according to claim 1 having the cDNA sequence according 
to SEQ ID NO: 1 or sequences substantially complementary to said cDNA sequence 
or fragments that are substantially complementary to fragments of said cDNA 
sequence. 

3. A nucleic acid sequence according to claim 1 which is a genomic sequence. 

4. A nucleic acid sequence according to claims 1 - 3 comprising the SEQ ID NO: 3. 

5. A nucleic acid sequence according to claims 1 - 4 comprising the SEQ ID NO: 5. 

6. A nucleic acid sequence according to claim 3 and comprising the SEQ ID NO: 6. 

7. A recombinant host cell containing a nucleic acid sequence according to any of 
claims 1 - 6. 

8. A vector comprising a nucleic acid sequence according to any of claims 1 - 6. 

9. A protein encoded by any of the sequences according to claim 1 - 6 or a functional 
fragment or variant thereof. 

1 0. A protein having the amino acid sequence selected from the group consisting of a) 
the amino acid sequence shown in SEQ ID NO. 2; b) a portion of the amino acid 
sequence shown in SEQ ID NO. 2; c) amino acid sequences which are homologous 
variants of the amino acid sequence shown in SEQ ID NO. 2; or d) portions of 
amino acid sequences which are homologous variants of the amino acid sequence 
shown in SEQ ID NO. 2. 

1 1 . A recombinant nucleic acid sequence comprising a genomic or cDNA sequence that 
encodes a protein according to claim 10. 
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12. A recombinant nucleic acid sequence according to claim 1 1 wherein said genomic 
DNA or cDNA sequence is operatively linked to an expression control sequence in 
said nucleic acid molecule. 

13. A method of detecting a genetic defect in the gene responsible for ataxia wherein 
the gene comprises an nucleic acid sequence according to claim 1, said method 
comprising the steps of a) amplifying one or more fragments of said gene and the 
ataxia gene known to be normal by the polymerase chain reaction (PCR) with the 
same or substantially the same PCR primers; b) determining whether said ataxia 
gene contains any mutations or genetic defects by comparing the PCR products of 
the amplification of said gene with those from the amplification of the normal gene, 
and c) detecting differences between the PCR products associated with mutations. 

14. A method according to claim 13, wherein the gene comprises a nucleic acid 
sequence according to SEQ ID No. 3 (exon 8) or fragments thereof. 

15. A method according to claim 13 or 14, wherein the gene comprises a nucleic acid 
sequence according to SEQ ID No. 5 (exon 9) or fragments thereof. 

20 16. A method for identifying or screening of candidates for pharmaceutical agents 
useful for the treatment of disorders relating to mutations in the ataxia gene 
comprising providing a test system containing a transformed host cell according to 
claim 7 and determining variations in the phenotype of said cells or variations in 
the expression products of said cells after contacting said cells with said candidate 

25 pharmaceutical agents. 
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SEQUENCE LISTING 

<110> Rappold Dr., Gudrun 
5 <120> Ataxia gene 



<13 0> ataxia_ba 



<140> 
10 <141> 

<160> 6 



<170> Patent In Ver. 2.0 

15 

<210> 1 
<211> 1600 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> Description of Artificial Sequence: cDNA 



<220> 
25 <221> CDS 

<222> (1)..(1251) 



<400> 1 

atg aca act ctt gtt cct gca acc etc tec ttc ctt ctt etc tgg acc 48 
30 Met Thr Thr Leu Val Pro Ala Thr Leu Ser Phe Leu Leu Leu Trp Thr 
15 10 15 

ctg cca ggg cag gtc etc etc agg gtg gee ttg gca aaa gag gaa gtc 96 
Leu Pro Gly Gin Val Leu Leu Arg Val Ala Leu Ala Lys Glu Glu Val 
35 20 25 30 



aaa tct gga acc aag ggg tec cag ccc atg tec ccc tct gat ttc eta 144 
Lys Ser Gly Thr Lys Gly Ser Gin Pro Met Ser Pro Ser Asp Phe Leu 
35 .40 45 

40 

gac aaa ctt atg ggg cga aca tct gga tat gat gec agg att egg ccc 192 
Asp Lys Leu Met Gly Arg Thr Ser Gly Tyr Asp Ala Arg lie Arg Pro 
50 55 60 



45 aat ttt aaa ggc cca 
Asn Phe Lys Gly Pro 
65 

agt ttc age tec ate 
50 Ser Phe Ser Ser lie 

85 

ttc ttg egg caa cag 
Phe Leu Arg Gin Gin 
55 100 



ccc gtg aac gtg acc tgc 
Pro Val Asn Val Thr Cys 
70 75 

acc aag acc aca atg gac 
Thr Lys Thr Thr Met Asp 
90 

tgg aat gac cca cgc ctg 
Trp Asn Asp Pro Arg Leu 
105 



aac ate ttc ate aac 240 
Asn lie Phe lie Asn 
80 

tac egg gtg aat gtc 288 
Tyr Arg Val Asn Val 
95 

tec tac cga gaa tat 336 
Ser Tyr Arg Glu Tyr 
110 



60 



cct gat gac tct ctg gac etc gat ccc tec atg ctg gac tct ate tgg 
Pro Asp Asp Ser Leu Asp Leu Asp Pro Ser Met Leu Asp Ser lie Trp 
115 120 125 



384 
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20 



40 



60 



aag cca gac etc ttc ttt get aat gag aaa ggg gee aac ttc cat gag 432 
Lys Pro Asp Leu Phe Phe Ala Asn Glu Lys Gly Ala Asn Phe His Glu 
130 135 140 

gtg acc acg gac aac aag tta ctg cgc ate ttc aag aat ggg aat gtg 
Val Thr Thr Asp Asn Lys Leu Leu Arg He Phe Lys Asn Gly Asn Val 
145 ~ 150 155 160 



ggc ctg ggc ate acc acc gtg etc acc atg acc acc cag age tct ggc 
Gly Leu Gly He Thr Thr Val Leu Thr Met Thr Thr Gin Ser Ser Gly 
290 295 300 



tgg atg get gtg tgt ctg etc ttt gtg ttc get gee ttg ctg gag tat 
50 Trp Met Ala Val Cys Leu Leu Phe Val Phe Ala Ala Leu Leu Glu Tyr 

325 330 335 

get gec ata aat ttt gtt tct cgt cag cat aaa gaa ttc ata cga ctt 
Ala Ala He Asn Phe Val Ser Arg Gin His Lys Glu Phe He Arg Leu 
55 340 345 350 



480 



ctg tac age ate agg ctg acc etc att ttg tec tgc ctg atg gac etc 528 

10 Leu Tyr Ser He Arg Leu Thr Leu He Leu Ser Cys Leu Met Asp Leu 

165 170 175 

aag aac ttc ccc atg gac ate cag acc tgc acg atg cag ctt gag agg 576 

Lys Asn Phe Pro Met Asp He Gin Thr Cys Thr Met Gin Leu Glu Arg 
15 180 185 190 

ttt ggc tac acc atg aaa gac etc gtg ttt /gag tgg ctg gaa gat get 624 
Phe Gly Tyr Thr Met Lys Asp Leu Val Phe Glu Trp Leu Glu Asp Ala 

195 200 205 



cct get gtc caa gtg get gag ggg ctg act ctg ccc cag ttt ate ttg 672 
Pro Ala Val Gin Val Ala Glu Gly Leu Thr Leu Pro Gin Phe He Leu 
210 215 220 



25 egg gat gag aag gat eta ggc tgt tgt acc aag cac tac aac aca ggg 720 
Arg Asp Glu Lys Asp Leu Gly Cys Cys Thr Lys His Tyr Asn Thr Gly 
225 230 235 240 

aaa ttc acc tgc ate gag gta aag ttt cac ctg gaa egg cag atg ggc 768 
30 Lys Phe Thr Cys He Glu Val Lys Phe His Leu Glu Arg Gin Met Gly 

245 250 255 

tac tat ctg att cag atg tac ate ccc age eta etc ate gtc ate ctg 816 
Tyr Tyr Leu He Gin Met Tyr He Pro Ser Leu Leu He Val He Leu 
35 ' 260 265 270 

tec tgg gtc tec ttc tgg ate aac atg gat get gec cct gee cgt gtg 864 
Ser Trp Val Ser Phe Trp He Asn Met Asp Ala Ala Pro Ala Arg Val 
275 280 285 



912 



45 tec egg gec tct ttg cct aag gtg tec tac gtg aag gca ate gac ate 960 
Ser Arg Ala Ser Leu Pro Lys Val Ser Tyr Val Lys Ala He Asp He 
305 310 315 320 



1008 



1056 



cga aga agg cag agg cgc caa cgc ttg gag gaa gat ate ate caa gaa 1104 
Arg Arg Arg Gin Arg Arg Gin Arg Leu Glu Glu Asp He He Gin Glu 
355 360 365 

agt cgt ttc tat ttc cgt ggc tat ggc ttg ggc cac tgc ctg cag gca 1152 
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Ser Arg Phe Tyr Phe Arg Gly Tyr Gly Leu Gly His Cys Leu Gin Ala 
370 375 380 

aga gat gga ggt cca atg gaa ggt tct ggc att tat agt ccc caa cct 1200 
5 Arg Asp Gly Gly Pro Met Glu Gly Ser Gly lie Tyr Ser Pro Gin Pro 
385 ' 390 395 400 

cca gcc cct ctt eta agg gaa gga gaa acc acg egg aaa etc tac gtg 1248 
Pro Ala Pro Leu Leu Arg Glu Gly Glu Thr Thr Arg Lys Leu Tyr Val 
10 405 410 415 

gac tgagecaaga gaattgacac catctcccaa gggegaatte cctttcactt 1301 
Asp 

15 tcctcatctt caatatcttc tactgggttg tctataaagt getaeggtea gaagatatcc 1361 

accaggctct gtgaataggg tgggagctat agagtcctgc tgctggcctc ctgcttcctc 1421 

tgggtgggct ttctccctca gttagactcc attaggggtt tggacagttc cttcctgatc 1481 

tcccactcag aacttcaact accagtccca aagctatgtg ggectatatt gcatggtgcc 1541 

aatggtggct gtacttataa agatggctta tctaccccta aaaaaaaaac aaaaaaaag 1600 



20 



25 

<210> 2 
<211> 417 
<212> PRT 

<213> Artificial Sequence 
<400> 2 

Met Thr Thr Leu Val Pro Ala Thr Leu Ser Phe Leu Leu Leu Trp Thr 
15 10 15 

35 Leu Pro Gly Gin Val Leu Leu Arg Val Ala Leu Ala Lys Glu Glu Val 
20 25 30 



30 



40 



55 



Lys Ser Gly Thr Lys Gly Ser Gin Pro Met Ser Pro Ser Asp Phe Leu 
35 40 45 

Asp Lys Leu Met Gly Arg Thr Ser Gly Tyr Asp Ala Arg He Arg Pro 
50 55 60 



Asn Phe Lys Gly Pro Pro Val Asn Val Thr Cys Asn He Phe He Asn 
45 65 70 75 80 

Ser Phe Ser Ser He Thr Lys Thr Thr Met Asp Tyr Arg Val Asn Val 
85 90 95 

50 Phe Leu Arg Gin Gin Trp Asn Asp Pro Arg Leu Ser Tyr Arg Glu Tyr 
100 105 HO 



Pro Asp Asp Ser Leu Asp Leu Asp Pro Ser Met Leu Asp Ser He Trp 
115 120 125 

Lys Pro Asp Leu Phe Phe Ala Asn Glu Lys Gly Ala Asn Phe His Glu 
130 135 140 



Val Thr Thr Asp Asn Lys Leu Leu Arg He Phe Lys Asn Gly Asn Val 
60 145 150 155 160 
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Leu Tyr Ser lie Arg Leu Thr Leu lie Leu Ser Cys Leu Met Asp Leu 
165 170 175 

Lys Asn Phe Pro Met Asp He Gin Thr Cys Thr Met Gin Leu Glu Arg 

5 180 185 190 

Phe Gly Tyr Thr Met Lys Asp Leu Val Phe Glu Trp Leu Glu Asp Ala 
195 200 205 

10 Pro Ala Val Gin Val Ala Glu Gly Leu Thr Leu Pro Gin Phe He Leu 
210 215 220 



15 



30 



45 



50 



Arg Asp Glu Lys Asp Leu Gly Cys Cys Thr Lys His Tyr Asn Thr Gly 
225 230 235 240 

Lys Phe Thr Cys He Glu Val Lys Phe His Leu Glu Arg Gin Met Gly 
245 250 255 



Tyr Tyr Leu He Gin Met Tyr He Pro Ser Leu Leu He Val He Leu 
20 " 260 265 270 

Ser Trp Val Ser Phe Trp He Asn Met Asp Ala Ala Pro Ala Arg Val 
275 280 285 

25 Gly Leu Gly He Thr Thr Val Leu Thr Met Thr Thr Gin Ser Ser Gly 
290 295/ 300 



Ser Arg Ala Ser Leu Pro Lys Val Ser Tyr Val Lys Ala He Asp He 
305 310 315 320 

Trp Met Ala Val Cys Leu Leu Phe Val Phe Ala Ala Leu Leu Glu Tyr 
325 330 335 



Ala Ala He Asn Phe Val Ser Arg Gin His Lys Glu Phe He Arg Leu 
35 340 345 350 

Arg Arg Arg Gin Arg Arg Gin Arg Leu Glu Glu Asp He He Gin Glu 
355 360 365 

40 Ser Arg Phe Tyr Phe Arg Gly Tyr Gly Leu Gly His Cys Leu Gin Ala 
370 375 380 



Arg Asp Gly Gly Pro Met Glu Gly Ser Gly He Tyr Ser Pro Gin Pro 
385 390 395 400 

Pro Ala Pro Leu Leu Arg Glu Gly Glu Thr Thr Arg Lys Leu Tyr Val 
405 410 415 

Asp 



<210> 3 
<211> 441 
55 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: exon 8 of 
60 ataxia gene with neighbouring genomic sequences 
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<220> 

<221> CDS 

<222> (173) . . (322) 

5 <400> 3 

tccatctggg ctttcagaca atgggatatg tcatggaagg cttctttaaa caccagaaga 60 

aattcaggat aaagctcaaa aagagcaggc aatcgatagg ggttgaaaat ccactcagta 120 

10 ggccacggaa ggacttcaag aaggttgatc gttctgtcgc tggatgttgt ag gtg tec 178 

Val Ser 
1 

tac gtg aag gca ate gac ate tgg atg get gtg tgt ctg etc ttt gtg 226 
15 Tyr Val Lys Ala lie Asp lie Trp Met Ala Val Cys Leu Leu Phe Val 
5 10 15 

ttc get gee ttg ctg gag tat get gee ata aat ttt gtt tct cgt cag 274 
Phe Ala Ala Leu Leu Glu Tyr Ala Ala lie Asn Phe Val Ser Arg Gin 
20 20 25 30 

cat aaa gaa ttc ata cga ctt cga aga agg cag agg cgc caa cgc ttg 322 
His Lys Glu Phe lie Arg Leu Arg Arg Arg Gin Arg Arg Gin Arg Leu 
35 40 45 50 

25 

gtgaggtaca actagggagc ggcccactgc ctttctcctt tggactcctt cctccctagc 382 
tccgcctccc cacaaagccg agacccaccc cagggctgea agggactaga atagggcaa 441 

<210> 4 
<211> 50 
<212> PRT 

<213> Artificial Sequence 
<400> 4 

Val Ser Tyr Val Lys Ala lie Asp lie Trp Met Ala Val Cys Leu Leu 
15 10 15 

40 Phe Val Phe Ala Ala Leu Leu Glu Tyr Ala Ala lie Asn Phe Val Ser 
20 25 30 



30 



35 



45 



Arg Gin His Lys Glu Phe lie Arg Leu Arg Arg Arg Gin Arg Arg Gin 
35 40 45 

Arg Leu 
50 



50 <210> 5 

<211> 534 
<212> DNA 

<213> Artificial Sequence 
55 <220> 

<223> Description of Artificial Sequence: exon 9 of 

ataxia gene with neighbouring genomic sequences 



<220> 
60 <221> conflict 

<222> (319) . . (488) 
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<223> exon 9 
<400> 5 

gagatgtcct ctcacagtgg ccggcacttc 

5 

ttgcatttta catgggtagc ttagagccta 
atagcacgta gagctgctta gcatacatct 
10 gcatgcaatg ggcctgtgta cccaaggcac 
ttgatgggtg gtctgttctc tatccctctg 
tctcctgtaa tttcccagga ggaagatatc 

15 

tatggcttgg gccactgcct gcaggcaaga 
atagtcccca acctccagcc cctcttctaa 
20 tggactgagc caagagaatt gacaccatct 



<210> 6 
<211> 39796 
25 <212> DNA 

<213> human 

<400> 6 

gatctctttc actgtcataa ttttatcact 

30 

aggtcctctc tactgaggac ctctttagaa 
cagactcaag ccctcagctc caacctcctc 

35 agctgggcca ggggtatgct gcctccctcc 
ccagatgtgt attcttcaag tgtgagccaa 
tgcccaggcc ctggaaatgt tatagtaggt 
cccccaaaac taggaatgtc aggtgaccat 
ctctctaaaa taataattgg ttgcagttgg 

45 aacctaaaac tggtgatcag cagcttcctg 
caagcatgtg cactaagagg caaaatggca 
aacactcaac tgataaggaa agaacgcctc 

50 

tactgtgcat acggcccctc ccaaatgctg 
ccaagggaag aatcaagaga gaagagatgc 
55 ccccaagtcg aaggtcaaac catgcatttg 
acaagtatac gttacatcct tttgttccta 
tcctctaaaa cttgcctcag tctttccctt 

60 

tctgaggagt cgagaattga ggttgctgca 



actggatttt 


agataactct 


aaatgatgtt 


60 


tctgccatgg 


aatcatgaca 


tacactatat 


120 


atctcatgag 


tagtgttgtg 


accataaaaa 


180 


cttgtctgca 


taacattccc 


ctatgagata 


240 


cctttgagct 


ttgacctcaa 


attctgccgt 


300 


atccaagaaa 


gtcgtttcta 


tttccgtggc 


360 


gatggaggcc 


aatggaaggt 


tctggcattt 


420 


gggaaggaga 


aaccacgcgg 


aaactctacg 


480 


cccaagggcg 


aattcccttt 


cact 


534 



gttaaaattt 


ttgcaaaggc 


agtttcaaag 


60 


gtctttgtgg 


gatgggtaat 


gtccctcttc 


120 


tggagtcctc 


tgagctgacc 


gctgtggctg 


180 


actaccagtg 


tccagttgaa 


tgcacaagga 


240 


tctggtgcca 


ctgaatcctg 


caaagaggca 


300 


agtcagacat 


gagcagggca 


ggagagggct 


360 


caggtgatgg 


tcaggtgatt 


gttaaactgt 


420 


caccagggaa 


tgtctcccaa 


tagatagaaa 


480 


ataagatctc 


aggagctggg 


ctagtgggtt 


540 


gagtttaact 


ggtatatggc 


cttcctctag 


600 


atgtgagcat 


gcttacaact 


tcagtaaacg 


660 


gcaggccagt 


gcacatgtgg 


acagcccacc 


720 


aaacccccag 


aagcgtgcca 


acatataaaa 


780 


aacctctcaa 


gtcgcctgct 


tggtcttctt 


840 


cactaaaact 


tcttcagaaa 


ctttcactcc 


900 


tgccttatga 


ccctccatcg 


aattctttct 


960 


gacccgtaca 


gatttgccac 


tagcaggaag 


1020 
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accagaagtt gcaacagctg caggcagaga 
ttctggtggc aaggggatcc ttgatctgtg 

5 

aaaaaattga attcacacac actcacttct 
catcctgctc agctccagcc ctcctcaagc 
10 aaaaaagaga tttaaagctc agctattttt 
agctcacagc tatttttcct cttttctcta 
ttaatccttt aggccatggg ctgatgaacc 

15 

aggtgggtgt cagtggggat catgctcttg 
tgcaagacaa gactccctct ctgacccttt 
20 tattcctaaa atctttacaa tcttcaaagt 
tatggctggc agcaggtagg tagcctcagg 
agaattagag gattgtgatg ttcagccaca 

25 

gaaagttaag ttaatcactg atggccaatg 
cttcataaaa acccagaaag acagggtttg 
30 tgaacatctg cccaccagga ggatggcata 
gctcaggact ttagctctca ccctgtgtat 
taaaatctcc tttctaactg gtaaaattaa 

35 . 

tctagcaaac tgatcaaacc tgcggaaggg 
gtcagaagca gaggcaaaat aacctggggc 
40 cttggggact gagccctcaa cctgcaggat 
aattgaatcg gaggacatcc agctggtgtc 
aaattgggaa acacaccaca catttggtca 

45 

taagagcata agaataacag cttatgtttt 
aatctagttc cttctgccct ctgtgtggag 
50 atcacagcat tggagccagc acaagcagca 
gatcacactt ggctagggct gagcccaagt 
tggcacctgc cttaaagagg gcaggcaccc 

55 

tggcagcttg ggagttggaa acaaatctta 
ttttgattac ttggtatatg tccagccctg 
60 gattctgtga aacactctta taaggaaggt 



gtaggggtga 


ggtgctcagt 


gcaaagagaa 


1080 


gtggtgtgtg 


accactctgg 


gcagtgatcc 


1140 


tcctggagca 


agcagaacaa 


atgtttcttg 


1200 


tccaaccacc 


ccccaccaca 


ccaaataaaa 


1260 


aattttttgc 


ttctccaaaa 


gttggcctgg 


1320 


cactggagtt 


ttcatcgatt 


tgaaactctc 


1380 


ttctgaggta 


acctctttgg 


caaactgcat 


1440 


t toacrcrcrcaa 


aaacrcicrc c t fc 


CCt.C&CSCCCt. 
t>v>« y www. 


1500 


t c tcrcfca 1 1 1 




^» vy y >— i» v* fc»u* 


1560 


aatacttttt 


tcitatcrcf aa 




1620 


a tacicracf t to 


atraccaaaa 


a a a cc a a era r 


1680 


cccccagcct 


cctaaaaacTcr 

*»*«*y y ^^33 39 y 


craaaacrcract. 


1740 


atttaatcaa 


tcaagcctac 


ataataaagc 


1800 


aaaagcctgt 


ggatagccat 


acatggagtt 


1860 


tcccaactcc 


acagagacag 


aagctcctgt 


1920 


ctcttcatct 


ggccattcat 


ttgtatcctt 


1980 


gtaagtgctt 


ccctgagttc 


tatcracrctcrc 


2040 


ttcatacraaa 


ccccgattta 


tagecagtea 


2100 


ttgtaattaa 

Zj 13 w 39 33 


cactggaagt 


aaoacracacrcr 


2160 


ctgacgctat 


ctccaggtag 


gtagcatcag 


2220 


tcctgcagag 


ttgcttgctt 


aatotataacr 

39 37 *-39 *"37 ** 3 ^3 


2280 


cagaagtgtt 


ccatgttgaa 


tattattaao 


2340 


ttttctactc 


aggcatctac 


ctcttcccca 


2400 


aagactactg 


ggccttaagc 


tcctgaagga 


2460 


cttcctagaa 


atctccaagg 


cttcagagct 


2520 


tccatggaaa 


gaaaccttta 


caaggtgeca 


2580 


tcatcagggc 


aggcaactaa 


taggagtccc 


2640 


gaagacagac 


tgaaactaat 


agctaatagt 


2700 


ttctaactgc 


tttatagata 


attatgeatt 


2760 


actgttatct 


ccattttata 


ggcaaggcaa 


2820 
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ctgaggcaca gagaggttac attggtggta 
atctgtcatc agatcccagg ttcttttttt 
5 tctgtcaccc aggctggagt gcagtggcgc 
gggttcaagc aattctcctg cctcagcctc 
atcccaggct cctaaccact aggcacacga 

10 

cacttccaga tttagtcctt cactcaggac 
gttttgctgc agtcgggctc tgggacagtg 
15 ccctccaatg aatctgcagg agggccagga 
tacctggaag cagctagtaa gataacagaa 
cacctgcatc ccccacctct cccaagtaca 

20 

tctgtgctca attattccag tggaagggca 
tcagtgaagg agtttggcag atgtgccctc 
25 tgggggctgg gttacctgca gaactccagg 
tgctgagagg acctcgggct gcgctgcagg 
ctttgtggac agcctcctgg actcttcttg 

30 

tcctaaggca ggcttagtgg caatttttct 
ccatcttctc aatcagtggg ctacctgaaa 
35 ctttgactcc tctgcataaa aatagtcttg 
gctaaacatg aaagcaaggt cttcccatag 
gtggttagaa cctgtctacc tcacatcatc 

40 

tatttttatt cttactttta caggctcgcg 
aggatggtac tgtcctgtat atctcctaga 
45 gtatgccctt gtgaagtaca cgaagtgctc 
accagcctgt tagcttcttg tctgagtgtt 
acatagtaga ctcccaataa aatggctctt 

50 

gcagcatttt aggattgcaa atgactatca 
ccacgccttg gactaggccc tacccgtagt 
55 aagccaactt cacaactttt tctttagaag 
atgaggagat tgcgttaagg ccctgtctct 
agagcaatac ctgggaccaa cactgtgatg 

60 

tggggacttt gggactctag cttcttgttc 



gaatacggaa 


caacattatg 


aagctagaca 


2880 


tttttttctt 


tttttgagac 


ggaatctcgc 


2940 


gatctcggct 


cactgtaacc 


tctgcctctt 


3000 


ccaagtagct 


gggattacag 


gcatgtgctg 


3060 


catgtacatg 


agtcccctct 


cctgaatctc 


3120 


ctggttcctg 


atgcctgcag 


ggtttttagt 


3180 


tagccttagg 


gcatagagca 


aggagagagc 


3240 


taggaaagga 


gggagggcag 


caaaagcaaa 


3300 


cagcgagaat 


cctctcctcc 


ccaggcaata 


3360 


cagatatgca 


tgcactcgca 


tgcacataca 


3420 


gagcagggct 


gccagagaag 


agattgcttg 


3480 


cagctctgtt 


tttgaaagaa 


gccagagctt 


3540 


agcttagaga 


gtgcctccag 


agagggcaga 


3600 


ataggctcat 


tcaaaactct 


gtgagccaag 


3660 


atcaagttct 


aagactccat 


tctggaacct 


3720 


cttggccctg 


attattctgc 


agaaacccct 


3780 


acagaactca 


gtcttcatca 


gttccttagg 


3840 


gtgacccctc 


agttttccaa 


gaataagctg 


3900 


tgcttctctc 


ccaaaagctc 


actgtactgt 


3960 


agacatctct 


gcaattggca 


cctactcttc 


4020 


tgtacttatg 


cgactcagtt 


cactaaaata 


4080 


agttttgaaa 


atctgatgag 


ataaagtcct 


4140 


tatgtgctag 


aaaacacatc 


catctcaccc 


4200 


cccttcctag 


tgcccaacag 


agtactttgt 


4260 


cattaattca 


agatatgact 


gggaggaagt 


4320 


agcctctgtt 


actgtgagtt 


gcttcatggc 


4380 


ccccaagagc 


atcatcagat 


gacacaagtg 


4440 


ggtacactgg 


gagtagggct 


atgctgggct 


4500 


cccctcatta 


ataacaagga 


ggtgggaatg 


4560 


gtagaaggtg 


agggaagaca 


gaagaaggca 


4620 


attccaaact 


atttcagatc 


cattcgatgt 


4680 
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aagcagaatg aagggagaag 
ctaagactca gagaggggaa 

5 

cacctccttc agtttatagt 
aaccagggaa ggacacagcc 
10 tgattcccta ccctcacaca 
tttgaccttc acagcaagcc 
gaggaatctg gggcccaaac 

15 

ggctcacgcc tgtaatccca 
ggaatttgag accagcctga 
20 tacctgggcg tgatggtgca 
atcacttgaa cccaggagat 
cctgggcaac aagagggaaa 

25 

tagcatatat tggtgacatg 
aagggcatta cagctagtca 
30 tctgtgagac cctgagggaa 
actagggaac gataatatta 
tatgtagagt tcctgacatg 

35 

ttcctttctc tctgcaccca 
tggggagggg cagttggtaa 
40 tattccctta tcacctggcc 
ttggacgaaa gagtatccag 
tgcaaaacct ttaaatacaa 

45 

aaaaaagaca gatagacaac 
gcttggggtt gggggaggag 
50 cacaaatagg gatgcttacc 
agctttgtgg cctggtagga 
aatgtcaggt cctgacactg 

55 

gaagtgaaaa aaaaatgata 
catatatatt tgtctgttcc 
60 cctgcttctg tcttgggacc 



gctgagtggg accatctccc 
atgatcacag ccccatagac 
tgaggcccag aaacctgaag 
atgactggaa cccaggcagc 
aagggaaatg gttctttatt 
cagtagtcag gcaaagggcc 
agattttttc actaatagta 
gcactttggg aggctgaggc 
ccaacatgga gaaaccccat 
tgcctgtaat cccagctact 
ggagattgca gtgagctgag 
ctccgtctca aaaaataaat 
gccaggaaca ttaactagtg 
gccaaacctg ggtttgaatc 
gttgcttatc ctctttgggg 
ctaacctcac agcctgttgt 
gttggaggca tgtagtgagt 
ccccaacctc agtgcctctc 
tagagttgaa gaagtatgct 
ctagctcctc ctgcctccct 
agcgcactgg tactccattt 
attttaaatc agcaaatccc 
agatgtagag aaaggagcca 
gtagagataa agacgaaatg 
atgtggccga attccttttc 
ccaggcctcc ttgtttctcg 
gcccaatgac aatcctagcc 
agcaagtgct ctttgcagaa 
tttgtcatct ataccatttt 
actgcaaggg aatgttattc 



tctagcagca ggcagacagc 4740 
ctgcaggagc cctttgaggc 4800 
taattttatc caaggtcaag 4860 
ccagctcata gatcaggctt 4920 
attccatatg ccttaaccag 4980 
atcattccca ttttatacca 5040 
catattgggc caggtgtggt 5100 
gggtggatca cctgaggtca 5160 
ctctactaaa aatacaaaat 5220 
cgggaggctg aggcaggaga 5280 
attgcaccat tgcactccag 5340 
aaataaataa ataaataaaa 5400 
ctcttaacag agcagctgtt 5460 
ttggctcaac attattactg 5520 
ctcagtattc tcaactataa 5580 
gagcacgaag taaagcaata 5640 
tctccataac acagtaattc 5700 
tcttcaggag tccaggagga 5760 
ttgggacaca taaccagcaa 5820 
ccctcatccc accaaggaca 5880 
gtgttcctga gctggcagca 5940 
aattgcctga tcttggaggc 6000 
ggaggaccaa ataccccaca 6060 
aaggcgagga gacaagtgca 6120 
ccctaggcag acagtcaagc 6180 
gtattgctaa aaccaaggtt 6240 
agcaggagct tggctcagaa 6300 
gtctttccct tttatgaccc 63 60 
cttagtttgt agaaaactgg 642 0 
cccagcaata aagtaggtca 6480 
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gagccagact ggagacttgc attcctgtgt 
agggattgct cttcctaaaa ccatgtcctt 
5 tctcttctct tctctagcag aaggtctcta 
caggctgaag ctaggttggg ctaaaatgat 
ggtaaaaagc aggctgtctg agcatttccc 

10 

tatactcagt caaacaccat ccctgccttt 
aggtagtttt ctttctttct ctctttcttt 
15 ttctctctct ctctttcttc ccgtctttct 
tttttttttt tttttgagac ggagttttgc 
caatctcggc taaccacaac ctctgcctcc 

20 

ccggggtagc tgggattaca ggcatgcacc 
tagagacggg atttctccat attggtctgg 
25 acccgcctca gcctcccaaa gtagtgggat 
cagtttcctt aaaccagctg tcaggtgtct 
atttctcaga gcagccaggg gaatctctgg 

30 

gaaataaagc tttttaatga tccttgagtg 
actgccaagc tggcagcctt actgattagg 
35 ggtggaattg ggattaagag gaacaaaggg 
tgtggaggcc ccttgctggt ctccaatgag 
gcaggggcac ctcctccagg tggaccaatt 

40 

tgccatggac tcgatgtgtt ccttaaaatt 
agtgttggaa tatgggcctt ttgggaggct 
45 taaaaagggc ttgtgggagt cggttcactc 
ctggggacac agcaaggccc catcttggaa 
ctgaactctc cagcttctag aactatggaa 

50 

tctttggttt tctgttatag cagcacaaaa 
ggctcttagt attccctgct cactaccact 
55 ggaaaatgga aacttgcatt tgctgagcaa 
tttcctatac cttatctcat ttctttcttt 
tcaggtgcaa agaatttcat tgactggaat 

60 

tatgcatggc ccactttgta tcattttatt 



ggttgtctcc attgtcacca ctcaagtctc 6540 
gggagcaagg ctttgcccat tgttttctct 6600 
aaggctacta tccctagaac tggccatctt 6660 
cagccttcag tagaaatcaa ggctgaaaca 6720 
taccagagac ccttcagcta agtgtctggg 6780 
gtgttcatat tctgtgtccc tattcaagaa 6840 
cttttctttc tttccttttc tttctttctt 6900 
ttctctctct ttctttcttt cctttctttc 6960 
tcttgttgcc caggctggag tgcaactgcg 7020 
cgggttcaag ccattctccc gcctcagcct 7080 
atcatgcccg gataattctg tattttttag 7140 
ctggtctcga attccaaacc tcaggtatcc 7200 
tacaggcgtg agccaccatg cccggcctgg 7260 
cggccctcag cttcctgagc tatgctgccc 7320 
gcaagtgtag tgttgcatct gtcagaagaa 7380 
aatagagagg ttagtttcat taaatacagg 7440 
ctccttggat caagttctca gcacaaagat 7500 
agcccacatc ccaatgagaa acattcagga 7560 
gtgtttttgt ttcacttttg gtgcccttca 7620 
gacatccaag gacacgttta aatgttcagg 7680 
catatgctgg aaattaatct ccaaagtgac 7740 
ccaaccacat gaatggatta atgtcaacta 7800 
tcaagtcctt ctgccttatc cgtttcaatg 7860 
ccagagacca aatctattgg cgccttgatc 7920 
ataaatttat gttctttata aattacctag 7980 
cagactaaga caacagatgt ttagtttctg 8040 
tccctctcaa ggaagaagaa gggtgaagca 8100 
gtattggttg ccagacactg ctttgggcac 8160 
ttttttttct ttcagggtaa gggatgtgct 8220 
actgttagag tcatattatc atcatcagtt 8280 
tctattttta tttattttta ctttatttat 8340 
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ttccctcttt atcacccttc ccaatcccat ctcctttcat gatggatatg cactcttttt 8400 
tttgagacag aatttcacgc ttgttgccca ggctgcagtg caatgatgca atcttggctc 8460 
actgcaacct ccgcctccca ggttcaagcg attctcctgt cttagcctcc agagtagctg 8520 
ggattacagg catccaccac catgcatggc taattttttg tatttttaat agagacaggg 8580 
10 tttcaccatg ttggccaggc tggtctcaaa ctcctgacct caggtgatcc acccgccttg 8640 
gcctcccaaa gtgctgggat tacaggcgtg agccactgca cccggccaga tatgcactct 8700 
tgtgtgtttg atttacataa atggtattgt tctgtacatc ttctctgctt ctttgtttca 8760 
tgtaatatta taattattga gagctaattc tgttgataac tgtgaaagta atatgttcat 8820 
tttaaccact atataatatt ccatgatttg catattttgc attttattta tccattcctc 8880 
20 tgttgatgga catttaggtt gttttcaatt tctctgtatt acaatattgc aacaaacatt 8940 
tcatgcaagc tactttgtgt acctgtgaga gtagttctag ggggatatag aaatgttggc 9000 
caggcatggt ggctcacgcc tgtaatccca gcactttggg agccgagcgg cggatcacga 9060 
ggtcaggagt tcgaaaccag cctgaccaac atggagaaac cctgtctcta ccaaaaattt 9120 
agtagagacg gggtttctcc acgttggcca ggctcgtgta ggactcttga cgtactgatc 9180 
30 cgcgtgcctc ggcgtcccaa agtgctggga ttacaggcgt gagccaccgc gcccggcctc 9240 
tccaatattt ctatagttgt atagttttgc ctttcatgtt taagtgtgca gtacctctaa 9300 
tgtttattac gtacgtggca tgacataagc atttaatctt gtttttctat gttataagcc 9360 
aatttctcct attttatgcc tgttattcat agcatggacc tttcctgtgg ctcttgcttc 9420 
atgagcagta aatcaagata cattcaccaa gttcccatcg tattcagagc cctgaagaag 9480 
40 cgggattcta gtggagttta tgacacagaa cctgccttca ggaaatatgc catcttgctg 9540 
ggaaaataaa cccatggaga gaagaaaagg ttttgagagt atacataaag caatacctaa 9600 
gtgaggataa atagaaacct tggaaactga gaatacatgt ttggagggtc aaaaagggag 9660 
cagggtaccc attcctaggc tgggatggat aaaaggattg aactagaagt acagggacac 9720 
cattggatat tttcactgtg agatgatcct cgaatcaagg tagctttatg cagttgccac 9780 
50 agaatgaaag aggagatagg agacactgag acttcaccat tggccattat ctggccttgc 9840 
tggtccgctc agagtttcct tcccatgccc ttctcctttc ttattgctcc tcagccccag 9900 
attactaatt aataaggaag caaggtcatt ttttagtgca gttcaggtac ccagtccaaa 9960 
aaatctccct gaatcagctg tctccattct cagagccatg gattaaattg aagggaacta 10020 
atttaaaata ataataataa cacaaaaaca ctaataacaa caggaataat tgtaacataa 10080 
60 taacctcaca ccctttcagg agctcatatc ttctctatga atatgatcta ccttgctcac 10140 



35 



45 
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cagtgggaag agaggaagct gattgaagga 
tagaaaggga tcttggggtg tggcagacac 
5 ctcagcctaa actggctttt agtttgcctg 
gtgtggaggc ttcagagagc ttcaacgccc 
aggatgctgt gccatgccac aaaggtggta 

10 

agtctacatt acaaagtgta caccaattac 
cccacgtcag gagctgattt tgctcctagt 
15 ctggaggggt aggtgggagg ggaacagggg 
aaactcatca aaactaacaa acaaaaaaac 
gcctacttct taccatggca accagtaagc 

20 

gttagtaaag agaggaggct ggctgctttg 
aactggtcct ggggagctgt ggaagttgaa 
25 tgctgaagtg aggaatggca agtttccatt 
aggtggtaag ggcctccaat gccaacttgt 
accctgccca ctctcttccc cattccccaa 

30 

agtcagctct ttgcagcaca aagtgatttg 
gacagaaagt caaagtcagc agggtactta 
35 ttggtgagag tgcaactctc taaagaaaca 
caacttagct tctctgacct ccaactctgc 
acagccctca tttggttcaa tctgatcaca 

40 

ctgccaaggt gggcagtttt tcaagttata 
ttcctaagaa actgaggctg gagccagatt 
45 atgatatcac tcctgaggct agtttccagg 
atgaatgaat tcttaggtcc tttcttaaat 
tggaaaggaa ggactcaggt gaggcccctt 

50 

tggttatttg ttcattttgt ttccttattt 
agtactactt tctccctaaa aacactatct 
55 tactatctgc acactttccc attcctttag 
aatcatctcc gcttcagcct cagcctccaa 
gagccagaat agagtcttgc tttcacttac 

60 

tggctaccac tgtatctcta gtattccttg 



gattttgtgg agcgggtgcc tggactctgg 10200 
aagacagaag gtcaggactt agctgccttt 10260 
tgactttcct atttggccag aaaggaaaaa 10320 
cctcctcccc tagcaggccc cctctgcctc 10380 
catcttgcct tccaaacttg gctctcccag 10440 
cctcagcagg gcagcctgga tttagtgtag 10500 
aatctccttc tgctttagtg gtttgagact 10560 
ggcaggtggg tggaggaaga aaagcaaaca 10620 
tccagctctc tgctcattaa tcctctcagg 10680 
ctccaggcct cctcagtcct cactccagtt 10740 
cctgggatga tttcaatact ggacaaatta 10800 
ggaggaaagc ggtagagaga ggaaagggaa 10860 
aggcagcatg tctcaggatc tctcttggga 10920 
gttaaccctc aacggtcttg atgaaggcca 10980 
aggcaagatg atgcagtgtg tgggggtgtc 11040 
ctccgtcttg aaccaaaggc aagaagagga 11100 
ctttcacttc acccagaagg cagctggtca 11160 
gaagacacat tcctcaggaa aggttcaggg 11220 
ctcaccaagc ctcccaaagg caccattgcc 11280 
ggtcactgag ccccatggat ggataaccat 11340. 
tcagaggctg tgtaactccc actggcttct 11400 
ccttgaccca caatgaactg ggacttgact 11460 
caattcacat tggtcctaca acacggaggt 11520 
tcctgctttg acaaggaaaa ggcagaatgt 11580 
ggtgtttttg tcctgtccac agctagtgag 11640 
ggtttggttt ttcctattac aaagggaagt 11700 
ttcttcctct gtatgtcctc ctgtgcctga 11760 
tgagtacctg agtcccagag tctaaaaaaa 11820 
gaccttaata gcttaggcat attaatttct 11880 
tcccaaatgc ttgttgaaag tcaggtaatt 11940 
gcacatggtt ggcattcagt aaagatcctt 12000 
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tttttttttt ttttttttga gacagggtct gactgtgtcg ctcaggctgg ggagtgcagt 12060 
ggtgcaatca tggcttactg cagccttgac cttctctgct caagtgatcc tcccacctca 12120 
5 gcctccgaag tagttaggtg ggactgcaga tgctcacccc cacacctggc taattttttt 12180 
aatttttttg tacagatggg gtctcattat attgcccaga ctatagtgaa gaaattttaa 12240 
10 acgaatctta gtatatgtat aatctcagag tttttcgctc acttaaaata acagcagcat 12300 
ctaaagcttc catttattga aaagctactg tgtcagacag agcctgtgct atggactaca 12360 
tatggacatt ctaatttaaa acttaggaca tttttgaggg acagaaactg ttattaattc 12420 
tgttccacag atgacaatca tgaggctcaa gaatgttgat taattttccc acatagttaa 12480 
taaatggcag aacctagtac caaactcaga taccttggac tccaaagatt gcctctctat 12540 
20 gcctctggtg ataggaaagt gccattgaga ctaagatcac tatttggctt tctgacatgc 12600 
cagttaggtt tgcacagaaa cagttccatg gctacatact ttgttctcaa ccctagcaaa 12660 
tcagtttatc caaaggggct caggtgagag tgtgctaagt agagagaagt tgccaagtac 12720 

25 

ccttttctac ttgtaaatat cccttacata tgtacaaagt ttccgtttaa aaaaatactt 12780 
ttttatttcc tacagtcttc acagcaagca tgagagagtt ttgttacagc tactcagttt 12840 
30 ttaaataagg acactgaaga tcagggaagt gttataaatt gtctaaggtc acacagccaa 12900 
gaagtagtag aacaggggct aaaatccact cttcttattc tagagcacca gagtataagc 12960 
cactggatcc ttttggcaca taaagcatta gaaaccttgc tttgtgtcag atgccaactc 13020 

35 

ccacaaaaag attactgctt gctggtggcc tatgcctgta ttcctggcta cctaggaggc 13 080 
tgaggggagg atatcttgag cccaggaatt tgagatcagc cagagcaaca cagaaagatt 13140 
40 ccatctaaaa agactgctgc attttattca gacaaggggg aactagagcc ccagttctat 13200 
gtcccaaatg gctaactgac aattgagaag aaggcttgca tctcatatgt tctgtcagaa 13260 
ttcgaaagac atcatcagga aaaccttttt atatcttctg aaagttgctg tctaagtatt 13320 

45 

agcaaccccc ctgccacccc acaacaggca ttgaatgagg actgggaatg cacatcactc 13380 
ccctccatct gtcctctttc ccctttgagc agtcttaatc attttgaaat ttcaatgaat 13440 
50 taagaaaaca ggccatcacc cctagatata acaacacctc tctttggaag gaggcattgg 13500 
tcataaagag atcactgctt tatggggcaa aagccatcag attactgggg attttgcacc 13560 
agcctaagta ggcttaactc ctaaatggca aagaggcttt gacaaggatc cctcagaatt 13620 
ctgctagtga ttcatgtggg gctttcctcc tcctcacctc tctctacttc tgtcacccca 13680 
gcccccactc ctcccataat aagcagtcct tccctgcctc tcccactgtc aagggactct 13740 
60 tctgtgaaag aataacattc ccagccgggc aagatggctc aagcctataa tcccagcact 13800 
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ttcggaggcc gagacgagtg gatcacctga 
atggtgaaac cccgtctcta ctaaaaatac 
5 tgtagtccca gctactcggg aggctgaggc 
gttgcagtga gctgagatcg tgccactgca 
tctcaaaaaa aaaaaaaaaa aaaaaaggca 

10 

cagtttggga ggccgaggca ggcggaccac 
caacttggag aaacccagcc tctactaaag 
15 gcctgtaatc ccagctactc gggaggctga 
agactgcggc gagccaagat tgcgccatta 
ctgtctcaaa aaaaaaaaaa aaaaaaaaag 

20 

tctttgctgc cttggtatgg caaaacttct 
gtggctcatg cctgtaatcc cagcactttg 
25 ggagtttgag accagcctgg ccaatatggt 
ttagcaggag gtgatggcag gcgcctgtag 
aatcacttga ctccgggagg cggaggttgc 

30 

agcaacagag caagactctg tctcaaaaaa 
cctacttagc gcagcactca caattagaca 
35 tggcaggggt gggaggctac cccaaagggg 
cttgagagac tgaatggcac ctctggaggc 
gcttaccttt cattcttcat tcacttgtac 

40 

tgaagaccat gggctctggg gccaggctgc 
atgtgaccct gaaaaaccat ttaaactagc 
45 tttggaaggc agaggcagga ggatcacttg 
catagtgaca ccccagccct aaaagaaaac 
tggcaagtgc ctgtagtctc agctactcgg 

50 

ggaaggttga gactgcagtg agctgtgatc 
atgagacact atctcaaaaa aataataaaa 
55 gcgctctctc tctcttacag aaagaatggg 
gaaagaagag gaggcctggc cagcaaatgc 
tagcagctct cacagagaat aggtggtaaa 

60 

atctcagtta atcgttccta gatcccaaaa 



ggtcaggagt tcgagaccag cctggcccac 13860 
aaaaattagc agggcgtggt ggtgggtgcc 13920 
aggagaatca cttgaatccg tgaggtggag 13980 
ctccagcctg ggtgacagag cgagactcca 14040 
gggcgtggtg gctcatgcct gtaatcccac 14100 
ctgaggtcag gagttcgaga acagcctgac 14160 
atacaaaatt agccgggcat ggaggctcat 14220 
ggcaagagaa tcacttgaac ctgggagcag 14280 
cactccagcc tgggcagcaa gagtaaaact 14340 
aataacattc cttcttcact gaattattca 14400 
cattaaaagc cctgggtgag gccgggcgcg 14460 
ggagcccaag gcaggcggat cacaaggtca 14520 
gaaaccccgt ctctactaaa aatacaaaaa 14580 
tcccagctac tcgaaaggct gaggcaggat 14640 
agtgagttca ggccactgca ctccagcctg 14700 
aaaaaaaaaa aaagccctca gtgggagtga 14760 
gagagaagac aaggggagga agagatgcag 14820 
ttgtgacagg gagtccccaa aacccaggga 14880 
tcctggctgc ggggatgacc acggtgcgct 14940 
acatgagagg cggcacacat gctgctgcca 15000 
tgggtttaca tcccacatat cttccgagat 15060 
catggtggtt catgcctgta atcccaacac 15120 
agcccaggag ttctagagca gcctgggcaa 15180 
aaaacaaaaa atttaattag ccgggtgtgg 15240 
gaggctgagg tgggaggatc gcttgagcca 15300 
acaccacggc actccagcct gggcgacaga 15360 
taaaagtaag aggaagagac tagggtgcac 15420 
ggcctggagt ggcaaaacag gttaatgaga 15480 
agtcttgtta tgtaaatgaa acctcacaag 15540 
tgttcctttc agacctttaa agttgtcaga 15600 
aggaaaggcc tttggaaagt ctggctgcag 15660 
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cgatgcaggt tttctctaca aatgcaaatc 
cttgagtttg caggccctct gagcagtcat 

5 

ggtaaaatat tttgatttcc ttcactagaa 
ggcaccctgg tcttggattt ccaacctcca 

10 ttagtcaccc agtcagtggt atttttttat 
tgcaagaaac cttgccacca gagcagctaa 
gcactgttgt ccaagctagg gcaggcacca 
attcatgtat tcatttacaa atgtttagct 
ggtgctgtag ggtgcacagc aatgagaaaa 

20 agaagctact aaggaggagc atctctgaag 
agttttttaa aaacccaaga tgttgtgagc 
aaactccaag gcagctgggt gaggtcgctc 

25 

gaggtgggtg gatcacatga ggtcaggagt 
cccgtctcta caaaaataca aaaatttgct 
30 ctactcagga gcctgaggca gaattgcttg 
aggtgcacca ctgcactcca gcctgggtga 
aaataaataa gaacgccaaa gagatgatgt 

35 

ggcatcaaac aggcaaagag tgaaggagaa 
cctttccctg gcccaggtca tttccaaacg 
40 ggtctggatg cttattggct tccacttgtc 
caaagcctag cccacatacg acgtgtatca 
atacctaggc tgcactcgcc tgagagcatt 

45 

tggtggggcc caagcagtaa tgatggggcc 
ggttgctcaa gcttggagaa tttctttttg 
50 gtctgtcacc caggctggag tgcagtggtg 
ctgggttcaa gtgattctcc tgcctcagcc 
caccacgccc agctaatttt tgtattttta 

55 

gctggtctca aactcctgac cttaggtgat 
actgcaggca tgacccaccg tgcctggccg 
60 gaaaatgggt tggtggcaaa aactgagcta 



tcccctacag aagacagctt tgcaaggcta 15720 
ttcaaaatat gtcaaaaaat tatattttgg 15780 
agagggccct cagcagatcc tgaccatctt 15840 
gaactgtgag aactaaattt cgcttatttt 15900 
ggcctcccta gctgactaag acaatgatgc 15960 
gatgagcccc catggagcaa ctgacactca 16020 
gtgcctcctc atccattctt gtatttattc 16080 
ggcacctacc ctgtgcctgg ccctaagcta 16140 
tcaggcccag gcctcaccct cgtggagctt 16200 
aatcacacac atgaatgtga gacacaatag 16260 
accaataaga ttgggtaagg ggccaggaaa 16320 
atgcctgtaa tcccagcatt ttgggagcct 16380 
tggagaccag cctggccaac atggcaaaac 16440 
aggcctggtg gcaggcgcct ataaacccag 16500 
agcccgggag gcagaggttg cagtgagcca 16560 
cagaaggaga ctccatgtca aaaaacacat 16620. 
ttaaatctgg acactgagaa agaaagagaa 16680 
gttggtgcag gtggaagggc aaggtgagtt 16740 
gttttgtctt agaggagtga ggctctctga 16800 
aaactctgac tgccaccaca tgggctttcc 16860 
gagtcaccag gaaaacttga taagatacag 16920 
ttagtgagca gtcatagtgg ggcccaagaa 16980 
caagaatgca ttttaacaag gaccccaaga 17040 
ttttttgttt tttttttgag gcagagtctc 17100 
caattttggc tcactgcaac ctctgcccca 17160 
tcccgagtag ctgggactac aggcatgcac 17220 
gtagggatgg ggtttcacca tgttggccag 17280 
ccacctgcct tggcctcccg aagtgctggg 17340 
ggaatctctt gacatactac atagtatgag 17400 
gtaaaccacc gcccatgttt acaagcctgg 17460 
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gtttaatagt gactttctct caaaagctat ctctggtctc tgcatgtcgt tttttctaat 17520 
atcaggctcg gaagagtgta ggctctcttt agcttggagt ttcttttttt tttttttttt 17580 
ttaagacaga gtttcactcc tgttgcccag gctggagtgc aatggtgcaa tctcggctca 17640 
ccacaacctc cgcctcccag gttcaagaga ttcttctgcc tcagcctccc aagtagctgg 17700 
gattactgcc atgcgccacc acgtccggct aattgtgttt gttttgtttg tttgtatttt 17760 
tagtagagaa ggggtttctc catgttggtc aggctggtct caaactcctg acctcaggtg 17820 
atccacccgc ctcggcctcc caaagtgctg ggattatagg cttaagccac cgcgcccagc 17880 
15 cagcttggaa tttctaagtg gattttttct taaggagacc aaccctcctg gtttgcctgg 17940 
tactctccca gttttagcat taaaaattct gtgtcccagg aaatcctcag tcttgatatt 18000 
aattggatcc ctaatatctt tggcaaatga ggatgactgg ttaacctagt gtcctagaga 18060 
agtgcctagc tggattcacc tagaaaaggg aggagtcact aaaagtctcc ccagtctccc 18120 
tactgccaag gtgctcccac ctgccaaact tgccttctca tccatgtgac aagaggtaca 18180 
25 cagaagagtc ttctccctcc aactccccag catgcatgct ttctcctgtt tgtttctgga 18240 
cagtgtcata tcaaaaagga gacatgacaa taagccagta ggtcccacag cataaccaat 18300 
gtgtttgaga tgggaatgat caaagcttta ccaagggcta agccctggtc ttagaaactg 18360 
gagcagaaag aaactatcag ctaacatatt cacagggtaa tagaaaagat gacatatagc 18420 
atcgccatgg caaccagcct ccaaaggccc ccagcacctg ctccattcta cacaatattt 18480 
35 ccccaaacag ggatatttct tacttccttg tttttctttt caataaagat gttaatttta 18540 
ttccccctca tttgtgctta atgttccagt agcaatttag ctccttctgg gctttcctag 18600 
cccctccctg tctctcccaa accttttctt ctcctgtctc ccatccttcc tttcccttcg 18660 
atctcttttt tttctctgtc cccacaattc ctgtccctgc ttccctagac aaatatttcc 18720 
aaccacctct cttctcccag agctaagccc tcttccagga tattttatca gctgcggttc 18780 
45 tgatggccta gtaagtgttg gtctttgcat tgggtgagtt ctctatggtg ctctcttgaa 18840 
atcccacgcc ctctcttctg gtgtctcacc acaccaccac catcccatct ccctgagcta 18900 
ggggacacag tttagtaaaa ccattttagc ccacagagca gatgaaggga aacaaaggct 18960 
gcaaggtgga gaatagtaac agttggggtt ttttgagtgt tgatcatatg cctgacccta 19020 
ctttagagct ttatatctat tgtcacattt aatcttcatg acaattctga tggcataggt 19080 
55 tttattgact gattgattga ttgagacaga ctctcgatcc atcacccagg ctggagtaca 19140 
gtggcccgat cttagctcac tacaacctct gcctcccggg ttcaagcgat tctcctgctt 19200 
cagcctccca agtagctgag actacaggca ccagctgcca catcctgcta atttttgtat 19260 
ttttagtaga gatggggttt caccatgttg gccaggcttg tctcgatctc ccgatctcaa 19320 
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gtgatccccc tgcctcggcc ttgtaggtat 
acggagactt agactggtta tgacacttgg 

5 

tcaggattta aacctaggca gtctgacagc 
aaaagaagat cctgtctgca ttcattgaat 
10 agcattattt gaagggtgcc cctggtcaat 
ttctgcttag gtccatggga gcagtgaagc 
gcctcttctc tcttcagttc gttcacacag 

15 

tttagcaagt atttcccatg agttaggctc 
tggagagacc gctgagtgtc aggaatcctg 
20 tctgagcttg tttctgggga caatgatgct 
acacctaaca tgatgcctga catagagggg 
atgggaaagt tgtaccagaa acaaagcagg 

25 

ggcctgaagg tgcttggctc ctaccaactc 
ccaaggctgc ccatggcatg cccctactta 
30 gatggtttac ttcttgtggt agaaaaagag 
tccaaaacca atcttctggg ctgacaaagt 
acattattat tattatattg agacaatgta 

35 

cagaacagac attaaataaa ttatttgggt 
aatctggcca tttccatttc tctctcttgc 
40 agctctgatt taagaaggag aatgagttac 
gtggatactt tagctgactt ttggcttaat 
acattttctg caacttggga aggccagatt 

45 

atggctacac ggtgcagtgc actgtgtata 
gccaagggga aatgtcacaa agcacaaagc 
50 cagtaatgtg acgggggcag ctggcctgga 
ctgcaccaaa tcacggatca ccttgagcct 
gaggagactg aattagatct ttaatatctg 

55 

tttgcatgag atgttgggga gtatggggga 
caagaatctc cacttgaatt tcagtctttt 
60 ggcaccaaat agaagccaca aggcacttct 



tatttatgac ttcacttcac acatagggaa 19380 
aaaaggtcat ggagctagtt agtagtagtg 19440 
caagtctgcc ccctttcatg aagaggtttc 19500 
gggaatgggg tcctggcaaa ccctttcatc 19560 
aaaactgtcc tgtaaaaaaa acacatgtcc 19620 
agaaagggag tggaggagct ctcaccctga 19680 
tctctcattt gttcattcat gcaacacacg 19740 
tgcccagcct atccatagga aactgtgttg 19800 
tctatgaagt ctgtctctcc acttgccatt 19860 
tacgagttga cagaaatcac ggctatggaa 19920 
ttaatttttt agaggcagaa tcacttctac 19980 
gtggtgctga attttcccag aggtgagatg 20040 
tcagggttgg ctctctcaaa gcagggaaac 20100 
tctcttagag taagcagctt cctctttcca 20160 
acccagccct ctgcaaagca agaatttggt 20220 
ctctagatac agacagacac tgctgaggca 20280 
tgcccagggt tgcaagggaa ggtggttcca 20340 
aagggaggag agtacaaaca attttaagga 20400 
tttctcctct cttatcctct ttattctctc 20460 
tgagtaaata gtggaggaag tagtttgtgt 20520 
attaaaggac ccccttggcc ccactgctga 20580 
gcctgggcct gtgcctgtca gggagagttc 20640 
ggagtgcaca tgggtgcctt gtggagagag 20700 
agaggacagg tttgagcttt tgggggaaag 20760 
agccaaagtt tctgagatct aatctaaacc 20820 
gtcatttcac ccttctgtgg ccttcaaaaa 20880 
gtcatttcta acgttatata acttggtggt 20940 
ttctagccct tcttgttcct ttgccccatg 21000 
tccctttcct caccaaatca atatatttca 21060 
aaattgtcga aaattacagt caaggctact 21120 
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tccatctctg ctcatggtag tgctaggcca agctataggg cagggttgaa gagggttggt 21180 
ggcaggagtc aggaagtggt ggagcaaaga agatttaacc ccttaaaaaa aaaaaaaacc 21240 
accatattca tttcatccat gatttgtcta gtggctgaaa tgatcccctt actaggatgg 21300 
agacaacttc accgtaagtc acagcaaagt ttcatttggg aggaagtatt gatcacaact 21360 
actcactgtt acttaatatt taaggatcca gtttcttctt acttgactgg agcagccagt 21420 
tgggggatag gcctacgtgg ctttgtctct tgacttgatg atgaacttgt ttgactttgg 21480 
gaaaaaggcc tatgccatcg ccaagctctg tggaatgcag gctggagaaa gtttagtttc 21540 
15 ttcctacaaa agtctccagg aggtcaggaa cagtggctca cacctgtaat cccagcattt 21600 
tgggaggctg aggcaggagg atcacttgaa cccaagtttc aagaccagtc taggcaataa 21660 
agtgagaccc ccatctctac acaataaaaa cttaggtatg gtggaatgca cctgtagtcc 21720 
cagctactct tgaagccgaa gtgggaggat tgcttgagtc cggaaggtca aggctacagt 21780 
gagcctgtgg ctgtaccact gtgctccagc ctgggtgaca gaatgagacc ctgtctcaaa 21840 
25 aaaaaaaaga gtctccaggg ggcctttgat tcctaggaaa gcaggatgct tatcctttct 21900 
ctactctcta ctcatctgct gagcccaaag ggagctgcat ggcagaatca aggctcatga 21960 
acagatcatt ctatacttcc tattgcccca ttcttcagat gaagagaatg accttgattc 22020 
ctattgggta tcaggagact tcagttgatc tctggaagat gatactctga acctgcaggt 22080 
gagtagaggt ctgagtagat gcagagcagt ggagaggcca aagactgaat atgaaactgg 22140 
35 acatgagaca cacatctaaa tattttcaga acactaacta caaaaagaag ccttgcatta 22200 
tttctgctcc agtctatctt tttggttggc ttgattttca ccaacttgca aacatggtat 22260 
tagttatgtt tgggaaagtg gagttggaat gagagtgaaa gtataaaatg tgaggcagag 22320 
aagacagaga tgtatagttt ttagtcttct ttttcccatc tgtaaaatga atctaagaaa 223 80 
gaccgataat agtagatcac attcaaacta ggagttaaag acactgaaat tgattgaata 22440 
45 attgatatgt attccatcaa cttgaaacaa aggattttcc tgatctctga taatttggac 22500 
acacatgagg gccagaaact tctctttcat agtaactaga atgactttcc agtcttcata 22560 
gggtcaaaag aatctttttg gggtaaagaa atggggaggg acctatggaa ccattagaat 22620 
ctatggtgaa ctgagctcta cactaggatt attattatta ttattttgag acagagtctt 22680 
gctcttatca accaggctgc agtgcaatgg tgcgatctca gctcactgca acctccacct 22740 
55 cctgggttca agcgattctc ctccctcagt ctcccgagta gctgggatta caggcgcccg 22800 
ccaccacgcc cagctaattt ttgtattttt agtagagaca gtggtttcac catgctgacc 22860 
agctggtctc gaactcctga cctcagataa tccgcctgtc tcgacttccc aaagtgctgg 22920 
gattacaggc gtgagccacc gtgcccggcc tacaccagga tttaattaat gcaattaaca 22980 
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tgagctaaga ctccccaggc cctggtcttc 
tggtctctgg tctgaggctt atctctttag 

5 

tgctggttgc tactttatct tgaacagaat 
atcagaaaag agccacttca aattctctac 
10 ggccctcact tccagtttcc tcctgggtgt 
ctcagtgaac tttctaaata gtcctcttct 
ctgtacacgg aaaataattt ggtctcccaa 

15 

agacttcctg gatctatgcc atggcttgtc 
gtctttcctg aaactggccc ataattgagg 
20 agcacctgcc tttgactcag cagtgggcac 
cacgtgggat ttaatgtgtg gttagaaagg 
atgggcagga aagaaaagtc agagttctca 

25 

tgttgctgac agtgcaggag tgggtggcag 
gtttggagtt gcctattgtc tgattagtat 
30 tgattagaat ctgtgatttg gggccatatg 
ctaccttgta caaagagaag tggctcaatt 
tgggcaagtg actggctaga agcacttcat 

35 

aaaacaaaga caagatctac ctgtactcag 
ttctttggcc acacactctt ggataataat 
40 atcaaggaga gttctgtgtg tgtgtgtgtg 
gaacactaca gaagtgcaaa tttaggctag 
ttggcatata acatttgcca actataacta 

45 

tttgtgcaca attctatgta gtaatcaacc 
ctattacaac aaacattaaa tttattcata 
50 gacaggagaa cagacagaaa gatagactga 
gaagaaggaa ggaaggaagg atcctaaata 
taaggatgct gttaatctcg cctaactctt 

55 

agacaaatat gtatgcatca tgagataatc 
tgcgtgttct gtagggttgg attattgaag 
60 tgatgacctg tgtgatgtga ctagcagaaa 



ttagcactga ttaacagctg ctccaggatt 23040 
tttagccttc tctgggaatc ttctgactgc 23100 
ctcagcaaac atttagccag aatcagagcc 23160 
ctccagagct caggatgagc tgagctccct 23220 
ttctctgcag tgctaaaatg ttcatcccta 23280 
actttatcct accatttcat atacctggtt 23340 
gtgcttttgc ttaaggtgat gcaactgagc 23400 
tgaaaatgtc aagtcctcag catcctattt 23460 
ctgattgtac tggatttgca gttgtgaaac 23520 
actgctgtgg aaagtgaggg gattatagaa 23580 
caatgtaaag gttacagaaa gaaactgaca 23640 
aagcaacact ctcttagtgt ctgacgatga 23700 
caaaagctca tgtgtgtgac ctgctatgtg 23760 
ctgagatgac tctttaggag tgcctgtcat 23820 
cgacataact ctgcaaaatg aatacctccc 23880 
tggcatctta cctagggatt ccaattattc 23940 
tagcacagta aatgtagaga cctcagccac 24000 
gataaaatgg actagtcatc acactccaat 24060 
tgttagcaac agggatatca tcttcagatt 24120 
tctgcatgca cacgtgtata ctcagataat 24180 
tacatcattt gtaatgttaa atttataatt 24240 
tgtgattggt gttagtaatg caaattcaga 24300 
aaactgttct ttactttagt gtccagtgtc 243 60 
gtaaaaccaa aaaagaaaaa cagaagaaaa 24420 
aggaaagata aagtgaaaag attgaagggg 24480 
attggaaacc actcagagaa gcaccacagc 24540 
ctagtgagga aggtgcttat taacagttgc 24600 
atgagattat cttttcaggg ttcttgggca 24660 
gataagcttc ttgaaagggt gtctagcagg 24720 
gattctggga tagataaaaa tgaagctcat 24780 
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tactgcctgc tgaaaaaaat agtctatttt 
tgctgtgaga ggcactcagg ccctggggca 
5 gggaaggaat tacattagtg acccactaga 
ctgctttttt cactgttgta tctccagagc 
gcactcaata aatatttgtt gaatgcatgc 

10 

acagtaaata ttcatttttc caagttctgt 
ttacatagca tgcattatca caatccccaa 
15 tcacaagcaa aaaccaattt acctttgagg 
acaagaatgt tgcagagtta caagagccaa 
gaaaccttaa tgaaactctt ctgcacagct 

20 

gagtaacgat tctgaaggtc cattcagaag 
cactacacct tccctagctg gagcttcacc 
25 agcccagcaa cagaggcagg gccattgttc 
ttgtctttgg agactgagag gaattcggtt 
tacttcacgt ttttgtctgc aaaactcata 

30 

agccaaatct ttgaagcatc agacttgaac 
gaatgtgtct atcatatctg caattttgtt 
35 gaaacctcca ccaccactac tgcccccaca 
gtggaagagc ttcttgcctc tgatgccagt 
tttcaagtaa attgtcatat ctttggtagt 

40 

cgcagcaaaa agatgcaggc ttgctcctta 
cagcagattt ttaaaaaggt gtctgtcttt 
45 gtgcttctga agagtagcta aagctctgag 
ctttcaaggt acctctcagc tctctctcta 
ccataccatc tgggtaattg aagttgagtc 

50 

ttgcccccca aaatgtagcc ctagaagtca 
gcagacctgc tgctcctcag actgaggttt 
55 tgcaattttc cctgcatagg gttgaaacgt 
cgaaaatgtg cttaatccat ttttagcagt 
tttcattctg aaaaacattt tgcctattgt 

60 

agccagttct cgggtgagta tggcagccag 



ctaggccata taataaatgc cctactcata 24840 
ttgctgatga ctcaggcaga agccttcttt 24900 
atctaagcta catgagggca tggactccat 24960 
ctagaatatc tggcatctgg catatgattt 25020 
atgaattagc gggagtgtga tccattccac 25080 
tgcttttatg ttatgtaagg gcttcatgcc 25140 
atgagtaaaa taagatattc atgaacatga 25200 
agaggcatga catggtccta tcatccccta 25260 
gagacaagga aagtcaatgc aaactctgtg 25320 
agtgaattgc tctcaagtgg ctgatgtgat 25380 
gagagaaatg atgcatatct ggatgctagt 25440 
actctgttgt tccctggatt cttaacagag 25500 
actggtgcca tcctggtgct cacatctggt 25560 
aacctctcca ctggcctccc caagttcaca 2562 0 
cctgaaatat tctattaaac aaaagaatga 25680 
cagtcactgt ccaaaagtca agagagcact 25740 
cttccttact ccacctgagt ctaggactgt 25800 
cactgggcac catcccctta attcccaagg 25860 
gctcctcagc accagaagtt gagcctccgc 25920 
tgaaaataac cctgaaaaga aaaattcaca 25980 
gagatccaac tgaaacgcca cggtcattgt 26040 
ttttcaaccc ccctccaact acttagccta 26100 
agtttcctca gattggggaa cacagcctgc 26160 
cttcaataag tgagataggc aatccggggg 26220 
cagaaaggta tgccaaatgg ttctccaact 26280 
ctgaatttac caaacaatac ccatcccaat 26340 
caggaagcct gcttatcagc agggcacagt 26400 
ggcagactct tttctggaat gaaacactta 26460 
ttggtagttt tttcatgaca aaataatatc 26520 
agcttctata caacttcagc actgcaataa 26580 
ttgcaaaaca ggaaaagggt gcgatacatt 26640 
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ttggaacatg atgatgcata tactccatag 
ggtgaggctc aacacaacgt tgtactcttc 

5 

gaaagtgttg cttcctgttg ttacagggaa 
gtgaacgcct agtgcagctt acccagttct 
10 gagcctgctt ccagcaccct aggaaagctg 
gtcatagcta cagcctctca caagaaggga 
gatctcagca ctcagagatc caatctggca 

15 

gtccccaggg gcctatgcct tccattagct 
cagacctaag ggacaaggga ataaaaactg 
20 catgctggcc tctctctgtg tgtcaaactt 
tgagcttgag gaaatattcc tgcccaggat 
tatagtgtgc tggggatttg gtggatgatg 

25 

taggggagac aggcaagtaa tctggtaatg 
gggtcctgtg ggaacacatt ggacagtctc 
30 ctccagaagt gttccttaca aaatgacaac 
ggaagagtac tttggataaa gggaacaaca 
aacaggaaac tctagatgca tcagtgggac 

35 

tgagagcaat ggggaactgt tgaagcagga 
gtgagttaca cagaggagac aagagccaca 
40 tctgtcctgg gaggttctgt aaggctcagt 
taaaggaaac aaggagggag ggtgagcaga 
ttttacattt actgaccata taggcattat 

45 

ccaaaagccc agcacccctg tctatatttt 
ttcatttatg gtttgaaaat ttctcttcct 
50 gcacaggttc cactcacggc tacctcatta 
tggaggagct tccaatcccc tcaaaccctc 
tccaggattg aacctcagaa ggtggcacag 

55 

cgcagagctg gcgggagagt tccccctgag 
ttgtcgtttt ggaggagagt gtgagagagt 
60 cgggaatgga gaggaagtga ggaagaaaga 



cttagtgatg agaagagaaa aaggttatgg 26700 
ccacagcaga aaatgctgag gtcatctttg 26760 
ttcttccaat tcccacctct cctgctgttt 26820 
ttacctccct ttctcccttc tataccccaa 26880 
tatctcttca ccatgtcagc caaacatttg 26940 
tctttcctct catgaagctg atcctgctat 27000 
cacaggaggt gggaccgaga caatagacag 27060 
aagaattcca gatgaccgag tgtgaggccg 27120 
ctttctcatt tttccttatt atcattacgt 27180 
cttgtttgac agagagagat gcagaggcac 27240 
gcatctttta acaaatgttc actaattgcc 27300 
tgtctctgct catcagggag cttgcgatag 27360 
tgacagtgtg acagagtgct gcaaaggtga 27420 
ctatcctata cccaagggac atgggggcag 27480 
gagactcagt ccagaagggt aagagtggag 27540 
tacaaaaaca ccaggaagct ggagagacag 27600 
aagcagagtg accaaccttg ggctgcagcc 27660 
gaagtgacat gttgagtttt gccacaggaa 27720 
cccctccccg ccagaccccc agcaaggagc 27780 
ttctgcccat ggagtgggga tcatgatgct 27840 
tgaaagggca gctaaagatg ttttaggtta 27900 
ttgagacttg taatgttctc agtcccaact 27960 
agtgtgcatt taaaattctc ttcttaccct 28020 
taataataac acagccaaaa gggagctgca 28080 
ttaatccttc catatagcca ataggccatg 2 8140 
tctggcaagg ttgaggaggc cacagtcagc 28200 
acccctcccc acagcatgca atcagaactc 28260 
ggagaaatga aaacgcttgg cttctcactg 28320 
gaagatacag gggaggtgga aagagaaaac 28380 
agcaaggcag acaggcacag aaagtgagac 28440 
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gcagagaaga aactgcgagg aagattggag atggagatgg acagagggtc cgtgaaggaa 28500 
aatttcaggg aggggggagg aagagaggag tagagtgaaa aaaagaagag agagcaagag 28560 
gaaatggggg aggggagaag tctcagaggt gtttagataa cacactggaa cctttgagga 28620 
agagtcaggc aggagaaaat ggagctgcca aagcaccttg ttgctgtaag caaatgtgca 28680 
ggaggcaagg gaagaggcca gcccacccct ctgcctctgt cctccatcca gggctgaggg 28740 
ccactaggtc actgctcttg acccgctgcc attcctctga agcctgcaaa gcctactgtg 28800 
tcccagcctt gctctgcatc tctgtccctt cccatgagcc aggctgggga aggagtgtct 28860 
15 gggaggagga attctgagag ctatagagga ccccaagaga cgacctgagg gaggaaagca 28920 
ggcacgggaa tggctgcaca actaactaca agggcgaggt gggatggggg caggtagaca 28980 
gatgaaggga cagacagact gtcaagcagg cagaccgcac ttgaggaggg ttcccaagtt 29040 
cctctgtatc atgcaccttc tgtgtctctg tccagagagt tgggggaaag ctattcctgc 29100 
cctcagcctc ttcaccttct ctctaacctg aagctgtcag cttgagaata cctggtggcg 29160 
25 ctgcctaaac cctctcaggt ctttttcccc ctggtaacaa tcattaggac tttactgttg 29220 
ggctccagct tggctccagg atgaaaggaa agctggtcgg gagtgggagg tggggggcgg 29280 
ggggagcagg gctgcttaca tgccctcctg cagctgggga gttagaggtt ggacagaaca 29340 
cctctggccc tccccaaaca gccatggcag cccaagctcc tccctggcac acgggcaggc 29400 
tctgaacccc actccagctg ctgccgactg tgcctttaaa ttactgcttt cggagggtgc 29460 
35 atctaggggg aggtgggagg caggaaaaca tcttcctacc ggtctccctc ctccctcccc 29520 
tccaagactc ttcagggttt ggagagtgat tgctgcccag agagaatctt ttccagctcc 29580 
cagctggcag gctaaggccc tccggagccc aaggcgagcc caagcagaag ccagtagggt 29640 
tatctgtgtc aggatcattt ccaggggaat agttctggcc cctggcaggt aaagacaggc 29700 
cagaggagaa gaggcagaag aggagagaaa gcaggctctt ttgcgagcag cccaggttgg 29760 
45 agaaaggctc tgtacttttg gcgttcctgc agggatatcc cctctcacat tggcagccag 29820 
gctgagaaag ggcttcaaga tccccgcaga atgacaactc ttgttcctgc aaccctctcc 29880 
ttccttcttc tctggaccct gccagggcag gtcctcctca ggtgagctga gtgggggtga 29940 
gaaggaggtg catgcaaaca ctagccaagg gggatgggca acctgggcat ctggcacact 30000 
ttcctctaag tttgcacaaa gcccgcagtc tggagggctt caggggagga cacgacctcc 30060 
55 cccagttagg accccaagag ctttttatca gagatgtcaa ggcaccggtc atgctgggaa 30120 
tgtttctgca caggtggaag cttcctaggc ctgggtaggg gctaggcaac taagccccag 30180 
aatggagggt gtctgcccag ccacctaacc ctgtttccct atcatctggg ccacacactg 30240 
60 ctcagatcgc taaaaatgat ggcaacatct gtcaactgcc agcccaagag gcttcaggta 30300 
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gcaggggttt gccccaagac cacatcagga 
tgcagtcacc ctgcctgtgc tcacttggtt 

5 

cagtcacttc ctgggcttgc aggaaagaca 
ccaattgccc ctttctcatt agaccaggag 
10 ggaaagatgg ctgtagcctg ggctgacctg 
ggggtcagta ccactgcggc atactcctcc 
agggctgctt gcgtctgcaa agcctcctgc 

15 

gcaggcaggg atgcagagca gggcaataaa 
gaatggcagc acagctggag ttgacaggag 
20 ccttgcaggc cctccaggag gggaggctgc 
ccactggggc ttggaaagca gaggcagtta 
tccaagtcct tcgttgtaca tacggaaact 

25 

gggtggctca gctacctagc accagagtta 
tacatgagaa agactgtcag ctctggatcc 
30 caggaatgaa agagaggaag gagtggctgc 
caagttagcc ccagaagagg gtactgcaac 
tagcctgtcc caggatacta ggtctaggct 

35 

gaggacggcg gggagggtcc agtttctcca 
aggtggcagg tggccagaag cagcagcagc 
40 gtcacagact tctgcttgct ccagcaccac 
ctctgtcccc agggtaaggg cttcctcttc 
acggtcaagg tggggctgcc actgaccttc 

45 

gggctgcctg caccatagta ttgtgataga 
atttgaatcc cggctttacc actcactggc 
50 gctgatcctc agtcttctca tttatataat 
ctaatgtgag aactaaacta gacagagtgt 
ttttattatt ctatcggtca ctggatagta 

55 

ggctagaaga agactagtcc ccagcaggac 
gcggcagcgg cagtgacagt ggtggagtcg 
60 agcaggaggt gcctggccca gcctggagtc 



tggagctggg ctagacattc tttggccttc 30360 
gctccctatt ccttgagggg gaaaaagtag 30420 
ggctttgggc tccctgtaaa tgttccccca 30480 
ccagcctagg ttggcagggt gagctgaaca 30540 
atcgtaagag gagtgcctcc caaggtggag 30600 
tcctcctcct gttaatccaa gaaacatcgc 30660 
cccactgttt gcagacacat tggctggcag 30720 
cttctaaaga gtgaaaagca gctcaaagta 30780 
gatgcaggca gaagggggca ccacactggg 30840 
ctcaaagctc aagctccagt agcagcattc 30900 
gagcaaaacg ggcccttaga gagtctggag 30960 
gaggcctaga gatggggaat gacctgccca 31020 
gacttctctc cctgcgccac cccaagactt 31080 
tcctaggtcc tgaaggtgtt taggactttc 31140 
acacggtgac tctgctctca ggtcctcaga 31200 
aggggctggg gaccaagact gctaagccca 31260 
tagcaggaga ggtgacctga gagtggggct 31320 
cgtgcaggac catgtggcct cccagcaggc 31380 
acctggctag cgtggacatc ctgtggtgaa 31440 
caccagcctc cttggatttt cagagcctgc 31500 
cttagaagat tcaaggaggc aggactctag 31560 
ccctactttg gctccttggc agcccaccct 31620 
tagagctagt tttgaagtca gtagctcttg 31680 
tgcatgatcg tgaagaattc acctcacctt 31740 
gggggtaatc ataatactta gttgagagga 31800 
gtaaagcatt tagcatgggg ccttagctat 31860 
catggatcta ctgggaaaga ggagcccact 31920 
agcagttacc ctgctgccca gtcccacagt 31980 
gctatagagc aatgacagca ggcaggaggg 32040 
cagctgagct cagccctgag cactgctcct 32100 
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gtccccaagg ctttgctaag ctcctagtcc ctagtttatc tgttagctgg caagttggta 32160 
agtaggcata tgaaaggggc tttgctgcta ttagagggaa acccgttagg ggaaaagggg 32220 
gctggcttct aagctatgcc aaagcacact ggctccttcc tcaaaatcaa gctgttagcg 32280 
gggaaaatac gggagttgac ctcagcatgt ctaccctacc aggaggcttt ttcccatccc 32340 
tagagtcctg cttccccacc atagttgcca aaactagttc acctgcctgc agtggcatgg 32400 
tccttaacta aaacgtagct caaacatagc ttcctgttgg agcacacacc agaagccaaa 32460 
actccctctt cagaagctgg gtcctacttc ctgagcagcc cagcataaag cccaagcaac 32520 
15 tgaagccaac tctcagaaat acaagttgtc cttgagccct tcccatcctt gactaggcta 32580 
gacgggcaaa actttgtagt agccacatca tttcccgcaa gacatcactc attgagtgtt 32640 
ttccaagtac cttcagaaag agaacactgg acgaggcctc agcctagggt ttgagtcctg 32700 
accctgtcac tagctctgag actttgaaca agtcagtcct ccttttgaag cctcagtttt 32760 
ctcatctgta aaatgggaat aacagtcctc actcccatct cactgagaat tgggagggta 32820 
25 agaagaccta ctgtctgtga aaacacttta ctagtactaa ctggcaggct gccgtgttga 32880 
cttatggcgt ttttgttctc attttgcaaa tggcacccag agacccattg tcttctatcc 32940 
tcctgcctga ggggtatgga aactgagagt ctgtgataag tgcagcacat aactggcctc 33000 
agactttttt acctctcctg tccagggtgg ccttggcaaa agaggaagtc aaatctggaa 33060 
ccaaggggtc ccagcccatg tccccctctg atttcctaga caaacttatg gggcgaacat 33120 
35 ctggatatga tgccaggatt cggcccaatt ttaaaggtaa gaaatcttca tcctataaaa 33180 
actcctccct ccccaccacc tttggcagca cactagccag ccagccccta ttgccttccc 33240 
taaggaaagg aggctggagg tccagggctg gggctggttc taacagctca gaaggagcct 33300 
cctcacccca ttgtccatgg gcctgtgtat ctggcactga tttctcccag cagtgccctg 33360 
gcctgcagct cagggaagga gttctggaag ctgctcccag cctcctggaa tgccctgcca 33420 
45 atggcctgtg ctttgctacc ctcaggccca cccgtgaacg tgacctgcaa catcttcatc 33480 
aacagtttca gctccatcac caagaccaca atggtaaggg atctccctgc tccccacttc 33540 
cagcctagtg tgagtgggag gagcccacca gataacaagc tggcatcttt tgccatatca 33 600 
gcccaaggaa ggttccttct ccagtgaaaa atgcctactg cccctgagat gtgttcccaa 33660 
cattcctcct^catggccctc ctgcccctac aggcttgctg ggcccctggg aatggcaatg 33720 
55 tttctgaagg gcccatatct gcaccctccc aggactaccg ggtgaatgtc ttcttgcggc 33780 
aacagtggaa tgacccacgc ctgtcctacc gagaatatcc tgatgactct ctggacctcg 33840 
atccctccat gctggactct atctggaagc cagacctctt ctttgctaat gagaaagggg 33900 
ccaacttcca tgaggtgacc acggacaaca agttactgcg catcttcaag aatgggaatg 33960 
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tgctgtacag catcaggtgc accgggtgga 
atttggatga agactgaggg gtgggaggag 

5 

agccgatgtc acttctttct tactgtcccc 
atggacctca agaacttccc catggacatc 
10 aagtgtctat atagagtcac agagagcctg 
catcacagac aggaaaaggg ccactcagag 
caattcttgg cacagatgag actagaaacc 

15 

tttcatcccc acgttatttg ctgatctggc 
tcagccttac gcacaaatgt gtttgtgctg 
20 ctgcccaaac gtatgtcccc tctcacagtg 
aattaagaca ttgaagatgc ctataaaaat 
gagccctgat tccccacaaa catagataac 

25 

aggcacaaga cagtgcctag aatgaagagg 
ccagcctgga ggggctgccc aaagtgcaga 
30 cttatttcca ctccctcctc atctcctggg 
cactagatgt tcacagaaga aggcttataa 
aatagaatgt gtcctatgtg gtcagctctt 

35 

ctctcaatat cccttcatgg caacgctgaa 
ttttacagaa ggggaagcga cttacccaaa 
40 cactggatgt ctgtgagcct tggtgtttct 
tggtcatata cgtggcatga gccctcatga 
gttcagacat tagctttaaa tttgccccaa 

45 

cacattgtcc catgcatgac tggcaggata 
ggcagaagtc ctggctctga ggctagctca 
50 actcccctcg gcttgagata tggcaggagg 
accctcttga cctgctttct aggtctgtca 
ttgagaacag ggtaggagag gagtctgtaa 

55 

ggcccatgag acaactgacc aagtgtcctc 
atgaagctcc taaaccccag accttactcc 
60 tacccctacc cccacccagc atccatactc 



tagccaggag agtttgcgtt aaggggaaga 34020 
ggttcccttg cactgtgatg ggccccagta 34080 
actaggctga ccctcatttt gtcctgcctg 34140 
cagacctgca cgatgcagct tgagagctgt 34200 
gaataacagc cagttaaccc gacctcatga 34260 
ttgggaggaa cttactcaag cacacccagc 34320 
aggggtccct aacccctagc ccagtgtgtt 34380 
tgttccatgg ggctggggaa cgggaaggca 34440 
ttttgggggg cgatttgagc gacatcttct 34500 
cctgaagtag ttccaccttt acctctcttg 34560 
gccagagtgg gcagggtggg ggcactcgtg 34620 
ctttgtagag aggtaggaga tgacagttct 34680 
tcagccaagg gctcctctgt ggcaggggtg 34740 
ggaaccaagg atgacaaggt tgggtcatca 34800 
tcaggagaga ctgttagatc tggagccatc 34860 
ggctgtttcc agcataaaat aactgtgatg 34920 
tagagttctt aaaccacttt cacatccagg 34980 
tagtggggag agcagctact ataattccca 35040 
gtcacagagc tactctgggt caaggcccaa 35100 
gctcccatct ctacctgaca tgtgcaaata 35160 
actgctatgt atgtgtgaat aacttgacat 35220 
ctcaaggccc tggctaggat ctatggtatg 35280 
taaatagctg tctcctccct gaggtgatgt 35340 
gcttaagaca tgtgttcggt acacatgctt 35400 
ctcccagctg tgttccttga catctttcca 35460 
gaataatttc tctttgggct tttctcaagc 35520 
tcttcctgat ttacctgccc actcaggatg 35580 
tccagaaaag ttttcctttc cataatgagc 35640 
tcacctgttc tctcctgcct gttctgatcc 35700 
tgcagccctc tgccatctct gtcactttca 35760 
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gttggctaca ccatgaaaga cctcgtgttt 
gtggctgagg ggctgactct gccccagttt 
5 tgtaccaagc actacaacac aggtaaaaac 
gaaccttcag atgtgaggca aggggacaag 
tgtcagatat ggggccaggt gcttcatata 

10 

accatttttc agaaagggaa aactgaggtt 
tcgcacagct agtaaatggc agatttagaa 
15 tgtgttcttg ttcctgtcgc atgctgcaga 
ttttcctggg gtcaactttg tgtggaagcg 
tatgctgaac ccagagggaa caagcctgag 

20 

cactgtcgct accactgcca gctctcagga 
gtggagaagg gctgctttcc caggctgagc 
25 gctggctcca cattttccct gacaatttgt 
cagcagcctg aagaggggag atcccgggtc 
gctgccccat gaatatgtgt gtgcatgtat 

30 

ggccattgaa tttagaaaat gcaaggggga 
aaaagttggc aatttgtata tagatcagtg 
35 cctggggagt ctattaaaat gttgcctcct 
gttacctgta cttggagtga tgcctagaaa 
tggtgcaggt ggtccagcaa acatgctttg 

40 

tttctactta tttatccagg aatcctactc 
aatactgcat tggccttaga ccccaagggg 
45 acttggaact gttttaaata tccaccaata 
ctgcttggtg gactagtatg cagacattaa 
gtgggaaagt aggagtgaaa aaaggtaaga 

50 

tatggtaaaa gaaacaaacc aaaagcatgc 
aatgtctcta aagtaatgtg atacaacaga 
55 ctgagtttgg attctggctc catcatttac 
atctcttcat gcctccgtat tcctatctgt 
aaagagttgc tataggatta aatgaggtaa 

60 

tgtaagctta tgaatattat tattccttct 



gagtggctgg aagatgctcc tgctgtccaa 35820 
atcttgcggg atgagaagga tctaggctgt 35880 
caggatcttt attggctgtg agccacagga 35940 
gcagctggcc tctatcgaag gttcaatcta 36000 
acttatttat ttgtgacagc aactataaat 36060 
ctaagaagtt agttaaatca agtgtccaag 36120 
tttaaatcca agccttctaa ctccaaagtc 36180 
gcaaatgaca gtgggtggac agtttcccac 36240 
gggaatccac atctggtaat agggagtgac 36300 
ggcagtggcc agcttacagt ccacacctcc 36360 
acttcccagg tgtgaaatgt cagctaagaa 36420 
atctgccccc ctgcagcttc tgtaagagtt 36480 
ctgcgcggag ctctgaccct gaaatagagg 36540 
ccttccatgg tgggctgata tgttttgggt 36600 
gtgggggtgt ggggagtggg aagaggaaaa 36660 
atgaaaattg atataaactt tttggagtaa 36720 
gttttcaaac tttggtgtgc actatattcc 36780 
gggctctacc tctcagagat tctgattatg 36840 
tctgcatttt aacaagcatc ctgaggattg 36900 
agaaacttta gtctagaagc taaaattgtg 36960 
cttgaaatgt attatatgga aatattccaa 37020 
gctcctgggt ggctgtttat attagcaaaa 37080 
gggcagaggt taagaaaatt acatgtgcgt 37140 
aaggaatggt tacaaggtgc ttatatcaat 37200 
ttcggaggta taaatacaga atgattatat 37260 
atttaaaaac cttaggtgga aatagcacta 37320 
taagagcaag aactttggag gcagatagag 37380 
tagttgagcg cccttgagaa atataattta 37440 
aatatgagag ataataattt tacttgcccc 37500 
agtcaggtcc cttagtaagg gtttgataaa 37560 
ctacttctta ttttttctat actaaacaag 37620 
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ttttaatttt ggaattaaaa acaaaactgc 
agaaaggaaa gaaagtatag gtacctaata 

5 

ggatgaagtg aggtggattc agggaacagt 
agccataatc agtcattgac aacatcactg 
10 ctagacctag gccagaataa aataacaata 
cctgagcctc tgcttccaag agaagctctg 
gaaagcagag taaagtgaag tctggcaaag 

15 

caatgcactt tttgttggta gaatagaatg 
ccccgctgtc tggagctggt ctctgttggg 
20 ttctcacttg cttaagacac aagcagtgtt 
ttctggggca ctagcaagca caagcatggc 
tgttttgtta ataatgataa caatagtaac 

25 

attatcttgt ttaatttttc caatagccca 
acaggcaagg aaactgaagc tcaaaacgct 
30 aggttcagga ttcaaaccca ggcagtctga 
gctgtgcagt cacgtgaaag gccttccgct 
agtctttgat ggtcgctctg agagcttcct 

35 

attatgaaag gctcattcat gggcccagct 
tgccctaatc ctgtcctaat cccctcaggc 
40 cacctcacct ctctccttgc agggaaattc 
cggcagatgg gctactatct gattcagatg 
tcctgggtct ccttctggat caacatggat 

45 

accaccgtgc tcaccatgac cacccagagc 
aagagacatg caagggaact tgcttgccat 
50 tgccctcctg ctcaccattt accttggatt 
tggtgtcctg cattctctct ccctgtaata 
ggctggggtt agggatacta tctggctagg 

55 

tctcttcagg aaatactagc atgtccaagg 
actgcctcat aatacaaatg aggagaatac 
60 atcatgcctt ataagaaaca accaaaacaa 
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aeggcaagaa 
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37740 
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37800 
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cctgataaga 


37860 
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cttggattat 


37920 


cgttagtatt 


ttcttagccc 
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37980 
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38040 
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ggaacctget 


38100 


cactgttttc 


agatgtttgc 


tatccggctc 


38160 


tatcccagtg 


aattgttgtt 


gctagagagc 


38220 


ccctgagatc 
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38280 


agctacctat 


gtgtcaaaca 


cctcaaatac 


38340 


gtgatgcagt 


tggtatcact 


actctgtttt 


38400 


tgggtgtttt 


gcccaaagtc 


acacagtgga 


38460 


ctctggagcc 


ctcctgctcc 


ttatcctctt 


38520 


ggaaatgaaa 


atactaaaaa 


ccaaaggcca 


38580 
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38700 


tcagctacag 
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taaoacc taa 


38760 


acctgcatcg 


aggtaaagtt 


tcacctggaa 


38820 


tacatcccca 


gcctactcat 


cgtcatcctg 


38880 


gctgcccctg 


cccgtgtggg 


cctgggcatc 


38940 


tctggctccc 


gggcctcttt 


gectaaggtg 


39000 


agagtacccc 


atttcccatt 


catcccaatg 


39060 


taaggttgag 


aagacccact 


gggcaaaagc 




tactgetaat 


aacattgttg 


acaatgtgga 


39180 


gaatgaggag 


agacagtgee 


ttactttgac 


39240 


tcaaaagatc 


ctcagcaagt 
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